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CLASP-2 TRANSMEMBRANE PROTEINS 

0.0 CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is a continuation-in-part application of U.S. Patent 
Application No. 09/547,276 filed April 11, 2000, which claims the benefit of U.S. 
5 Provisional Application Nos. 60/1 82,296 filed February 14, 2000, 60/176,195 filed January 
14, 2000, 60/1 70,453 filed December 13, 1999, 60/162,498 filed October 29, 1999, 
60/160,860 filed October 21, 1999, 60/134,118 filed May 14, 1999, 60/134,117 filed May 14, 
1999, 60/134,1 14 filed May 14, 1999, and 60/129,171 filed April 14, 1999, the disclosures of 
which are incorporated by reference. 

10 1 .0 FIELD OF THE INVENTION 

The present invention relates to molecules expressed in cells of the immune 

* 

system. In particular, the invention relates to a transmembrane protein that contains certain 
classical cadherin characteristics. 

2.0 BACKGROUND OF THE INVENTION 

1 5 The generation of an immune response against an antigen is carried out by a 

number of distinct immune cell types that work in concert within the context of a particular 
antigen. The helper T cell (Th) plays a pivotal role to coordinate two types of antigen-specific 
immune response; z.e., cellular and humoral immune response. Recognition of antigen by T 
cells requires the formation of a specialized junction between the T cell and the antigen- 

20 presenting cell (APC) called the "immulogical synapse" (Dustin, et a/., 1998, Cell 94: 667- 

677). The immune synapse orchestrates recruitment and exclusion of specific proteins from 

the contact ^rea by an unknown mechanism and is thought to be initiated by T-cell antigen 

.**•* 

receptor (TCR) recognition of peptides bound to MHC molecules (antigen) (Monk; et al. 

1998, Nature 395: 82). However, the. low affinity of the TCR for antigen as well as limited 

25 number of ligands makes it unlikely that TCR: antigen interaction alone is sufficient to drive 

the formation of the immunological synapse (Matsui et al. 9 1994, Proc. Natl. Acad, Sci. 

U.S.A. 91: 12861-12866). 

Costimulatory molecules such as CD4, ICAM-1, LFA-1, CD28, CD2 have 

been proposed to stabilize the cell-cell contact (Dustin, et al, 1999, Science 283: 649). 

However, since these molecules are recruited to the synapse after activation they cannot 
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account for the high specificity and avidity during the early phases of T-cell antigen 
recognition. Recent work demonstrated that a portion of the T cell surface at the leading 
edge is specialized to mediate the early phases of synapse formation (Negulescu, et aL, 1996, 
Immunity 4: 421-430). Such a specialization must be a pre- formed structure containing'cell 

5 surface adhesion proteins (ectodomains) to augment TCR engagement and corresponding 
cytoplasmic portions (endodomains) to transduce signals and bind cytoskeleton to maintain 
structural/functional polarity. 

The ectodomain of the pre-formed synapse or "immune gateway*' was recently 
discovered and is created in part by CLASP-1 (U.S.S.N. 09/41 1,328, filed October 1, 1999; 

10 PCTYUS 99/22996). In addition to cadherin motifs, CLASP-1 also contains a CRK-SH3 
binding domain, tyrosine phosphorylation sites, and coiled/coil domains suggesting direct 
interaction with cytoskeleton and regulation by adaptor molecules such as CRK. The 
CLASP-1 transcript is present in lymphoid organs and neural tissue, and the protein is 
expressed by T and B lymphocytes and macrophages in the MOMA-1 subregion of the 

1 5 marginal zone of the spleen, an area known to be important in T: B cell interaction. CLASP- 
1 staining of individual T and B cells exhibits a preactivation structural polarity, being 
organized as a "ball" or "cap" structure in B cells, and forming a "ring", "ball" or "cap" 
structure in T cells. The placement of these structures is adjacent to the microtubule- 
organizing center ("MTOC"). CLASP-1 antibody staining indicates that CLASP-1 is at the 

20 interface of T-B cell conjugates that are folly committed to differentiation. Antibodies to the 
extracellular domain of CLASP-1 also block T-B cell conjugate formation and T cell 
activation. 

3.0 SUMMARY OF THE INVENTION 

The present invention relates to a cell surface molecule, a member of a new 
25 multigeneTamily designated cadherin-like asymmetry protein(s) ("CLASP(s)"). In 

particular, it relates to a polynucleotide comprising a coding sequence for CLASP-2, a 
polynucleotide that selectively hybridizes to the complement of a CLASP-2 coding sequence, 
expression vectors containing such polynucleotides, genetically-engineered host cells 
containing such polynucleotides, CLASP-2 polypeptides, CLASP-2 fusion proteins, 
30 therapeutic compositions, CLASP-2 domain mutants, antibodies specific for CLASP-2 

polypeptides, methods for detecting the expression of CLASP-2, and methods of inhibiting an 
immune response by interfering with CLASP-2 function. A wide variety of uses are 
encompassed by the invention, including but not limited to, treatment of autoimmune 
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diseases and hypersensitivities, prevention of transplantation rejection responses^and 
augmentation of immune responsiveness in immunodeficiency states. 

In one aspect, the invention provides an isolated CLASP-2 polynucleotide that 
is: (a) a polynucleotide that has the sequence of SEQ ID NO: 1, 3, 5 or 9; (b) a polynucleotide 
5 that hybridizes under stringent hybridization conditions to (a) and encodes a polypeptide 
having the sequence of SEQ ID NO: 2, 4, 6 or 10 or an allelic variant or homologue of a 
polypeptide having the sequence of SEQ ID NO: 2, 4, 6 or 10; or (c) a polynucleotide that 
hybridizes under stringent hybridization conditions to (a) and encodes a polypeptide with at 
least 25 contiguous residues of the polypeptide of SEQ ID NO: 2, 4, 6 or 1 0; or (d) a 

10 polynucleotide that hybridizes under stringent hybridization conditions to (a) and has at least 
12 contiguous bases identical to or exactly complementary to SEQ ID NO: 1, 3, 5, or 9. In a 
related aspect, the invention provides a CLASP-2 polynucleotide wherein the polynucleotide 
encodes a polypeptide that binds to the PDZ domain of PSD95, DLG1 or neDLG. 2. In 
another related aspect, the invention provides a CLASP-2 polynucleotide wherein the 

15 polynucleotide encodes a polypeptide that has a binding affinity of at least 10 4 M" 1 for 
binding PSD95, DLGl or neDLG. 

In one aspect, the invention provides a CLASP-2 polynucleotide that encodes 
a polypeptide having the full-length sequence of SEQ ID NO: 2, 4, 6, or 10 or the cDNA 
sequence encoded by the inserts of ATCC Deposit Nos: PTA-1562, PTA-1563 and PTA- 

20 1573. 

In another aspect, the invention provides a CLASP-2 polynucleotide that 
encodes a polypeptide having the full-length sequence of Isoform 1 , Isoform 2, or Isoform 3 

(SEQ ID NO: ) or the cDNA sequence encoded by theinserts of AVC- 

PD14 (ATCC Deposit No. ) and AVC-PD 1 9 (ATCC Deposit No. ). 

25 In another aspect, the invention further provides an isolated CLASP-2 

polynucleotide comprising a nucleotide sequence that has at least 90% percent identity to 
SEQ ID NO: 1, 3, 5 or 9 as calculated using FASTA wherein said sequences are aligned so 
that highest order match between said sequences is obtained. 

The invention further provides an isolated polypeptide comprising a 
30 nucleotide sequence that has at least 90% sequence identity to SEQ ID NO: 2, 4, 6 or 10 and 
is immunologically crossreactive with SEQ ID NO: 2, 4, 6 or 10 or shares a biological 
function with native CLASP-2. 

The invention also provides vectors, such as expression vectors, comprising a 
polynucleotide sequence of the invention In other embodiments, the invention provides host 
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cells or progeny of the host cells comprising a vector of the invention. In certain 
embodiments, the host cell is a eukaryote. In other embodiments, the expression vector 
comprises a CLASP-2 polynucleotide in which the nucleotide sequence of the polynucleotide 
is operatively linked with a regulatory sequence that controls expression of the 
5 polynucleotide in a host cell. In certain embodiments, the invention provides a host cell 
comprising a CLASP-2 polynucleotide, wherein the nucleotide sequence of the 
polynucleotide is operatively linked with a regulatory sequence that controls expression of 
the polynucleotide in a host cell, or progeny of the cell. 

In another aspect, the invention further provides a CLASP-2 polynucleotide 
10 that is an antisense polynucleotide. In a preferred embodiment, the antisense polynucleotide 
is less than about 200 bases in length. In other embodiments, the invention provides an 
antisense oligonucleotide complementary to a messenger RNA comprising SEQ ID NO: 1, 3, 
5 or 9 and encoding CLASP-2, wherein the oligonucleotide inhibits the expression of 
CLASP-2. 

15 In another aspect, the invention provides an isolated DNA that encodes a 

CLASP-2 protein as shown in SEQ ED NO: 2, 4, 6 or 10. In certain embodiments, the 

CLASP-2 polynucleotide is RNA. 

The invention provides a method for producing a polypeptide comprising: (a) 

culturing the host cell containing a CLASP-2 polynucleotide under conditions such that the 
20 polypeptide is expressed; and (b) recovering the polypeptide from the cultured host cell or its 

cultured medium. 

The invention further provides an isolated CLASP-2 polypeptide encoded by a 
CLASP-2 polynucleotide. In some embodiments, the CLASP-2 polypeptide has the amino 
acid sequence of SEQ ID NO: 2, 4, 6 or 10, or a fragment thereof. In some embodiments, the 
25 isolated CLASP-2 polypeptide is cell-membrane associated. In other embodiments, the 
isolated CLASP —2 polypeptide is soluble. In other embodiments, the soluble CLASP-2 
polypeptide is fused with a heterologous polypeptide. 

The invention further provides an isolated CLASP-2 protein having the 
sequence as shown in SEQ ID NO: 2, 4, 6 or 10. In some embodiments, the invention 
30 provides a CLASP-2 protein comprising the sequence as shown in SEQ. ID. NO: 1 and 
variants thereof that are at least 95% identical to SEQ ID. NO: 2 and specifically binds a 
cytoskeletal protein. In certain embodiments the cytoskeletal protein is spectrin. 

The invention further provides an isolated antibody that specifically binds to a 
polypeptide having the amino acid sequence as shown in SEQ ID NO: 2, 4, 6 or 10, or a 
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binding fragment thereof. In some embodiments the antibody is monoclonal. In-other 
embodiments, the invention provides a hybridoma capable of secreting the antibody. 

The invention further provides a method of identifying a compound or agent 
that binds a CLASP-2 polypeptide comprising: i) contacting a CLASP-2 polypeptide with the 
5 compound or agent under conditions which allow binding of the compound to the CLASP-2 
polypeptide to form a complex and ii) detecting the presence of the complex. 

The invention further provides a method of detecting a CLASP-2 polypeptide 
in a sample, comprising: (a) contacting the sample with a CLASP-2 antibody or binding 
fragment and (b) determining whether a complex has been formed between the antibody and 
10 with CLASP-2 polypeptide. 

The invention further provides a method of detecting a CLASP-2 polypeptide 
in a sample, comprising: (a) contacting the sample with a CLASP-2 polynucleotide or a 
polynucleotide that comprises a sequence of at least 12 nucleotides and is complementary to a 
contiguous sequence of the CLASP-2 polynucleotide and (b) determining whether a 
15 hybridization complex has been formed. 

The invention further provides a method of detecting a CLASP-2 nucleotide in 
a sample, comprising: (a) using a polynucleotide that comprises a sequence of at least 12 
nucleotides and is complementary to a contiguous sequence of a CLASP-2 polynucleotide in 
an amplification process; and (b) determining whether a specific amplification product has 
20 been formed. 

The invention further provides pharmaceutical compositions comprising a 
CLASP-2 polynucleotide, a CLASP-2 polypeptide, or a CLASP-2 antibody and a 
pharmaceutically acceptable carrier. 

In one aspect, the invention provides a method of inhibiting an immune 
25 response in a cell comprising: (a) interfering with the expression of a CLASP-2 gene in the 
cell; (b) interfering with the ability of a CLASP-2 protein to mediate cell-cell interaction 
(e.g., interfering with a heterotypic and/or homotypic interaction) between CLASP-2 and an 
extracellular protein; (c) interfering with the ability of a CLASP-2 protein to bind to another 
protein. In some such methods, the cell is a T cell or a B cell. Some such methods comprise 
30 contacting the cell with an effective amount of a polypeptide which comprises the amino acid 
sequence of SEQ ID NO: 2, 4, 6 or 10 or a fragment thereof. 

In another aspect, the invention provides a method of inhibiting an immune 
response in a subject, comprising administering to the subject a therapeutically effective 
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amount of an antibody which specifically binds a polypeptide having the sequence of SEQ ID 
NO: 2, 4, 6 or 10. 

In another aspect, the invention provides a method of preventing or treating a 
CLASP-2-mediated disease comprising administering to a subject in need thereof a 
therapeutically effective amount of a CLASP-2 pharmaceutical composition. In some such 
methods, the CLASP-2-mediated disease is an autoimmune disease. 

The invention further provides a method of treating an autoimmune disease in 
a subject caused or exacerbated by increased activity of T H 1 cells consisting of administering 
a therapeutically effective amount of a CLASP-2 pharmaceutical composition to the subject. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Nucleotide and predicted amino acid sequence of CLASP-2 A 
cDNA. Notable protein motifs are indicated above the nucleotide sequence in bold. Potential 
initiator methionines are underscored. The notable, predicted protein motifs are: a cadherin 
cleavage site encoded by nucleotides 854-868, a cadherin ectodomain (EC) encoded by 
nucleotides 1253-1264, a transmembrane domain encoded by nucleotides 2861-2917, a coiled 
coil domain encoded by nucleotides 3579-3682, a second coiled coil domain encoded by 
nucleotides 3827-3937, and a PDZ binding motif (PBM) encoded by nucleotides 4046-4057. 

Figure 2. A. Schematic of CLASP-2 splice variants. Splice variants are 
compared to Human (h) CLASP-2A. Numbers above hCLASP-2A line diagram indicate 
where splice variations comprising deletions and insertions relative to hCLASP-2A are 
found. Abbreviations: "KIAA" KIAA1058 sequence (Genbank Accession No. AB028981). 
B. Nucleotide and predicted amino acid sequence of CLASP-2A cDNA. Notable protein 
motifs are indicated above the nucleotide sequence in bold. Exact position of insertions and 
deletions are indicated above the CLASP-2A sequence with arrows and **x", respectively. 
The nucleotide sequence of insertions schematized in FIG. 2 A are indicated above the arrow. 
The insertions and deletions are as follows (numeration refers to Human CLASP-2A 
nucleotide sequence): Nucleotides 1966-2034 are deleted in CLASP-2D. Nucleotides 2219- 
2224 are deleted in CLASP-2B. There is an insertion of 69 amino acids at nucleotide 2927 
found in CLASP-2D. The nucleotide sequence for this insertion is: 

AAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAG 
GAGGAGCCGGGGAG and encodes amino acids AVQWEPPLLPHSHSACLRRSRG (one 
letter amino acid abbreviation). This amino acid sequence encodes a putative SH3 binding 
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domain. There is another deletion at between nucleotides 301 1-3079 found in CLASP-2E. 
CLASPs 2B, 2C, 2D and 2E contain an insertion at nucleotide 3153 with the nucleotide 
sequence of: 

TGAGAGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGAC 
5 CGAGGTCATGCACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTC 
. TTCGGGCAGGCAGCGCAATACCAGTTTACAGACAGTGAAACAGATGTGGAGGGA 
TT. The cniuc sequence is found in CLASP-2D and encodes amino acids 
ERLAHLYDTLHRAYSKVTEVMHSGRRLLGTYFRVAFFGQAAQYQFTDSETDVEG 
while the underline sequence is found in CLASPs 2B, 2C, and 2E and encodes amino acids 

10 ERJLAHLYDTI.HRAYSKVTEVMHSGRRLLGTYFRVAFFGQG. This amino acid 

sequence encodes a putative immunoreceptor tyrosine-based activation motif (ITAM). There 
is a two nucleotide deletion in Human -CLASP-2C found at nucleotides 3586 and 3587, 
There is an insertion of 8 nucleotides found only in Human CLASP-2D with sequence: 
CTGGGATG at nucleotide 3937. This insertion puts a stop codon into the CLASP-2D 

15 nucleotide sequence. 

Figure 3. A. Alignment of nucleotide sequences of the CLASP-2 isoforms. 
Sequences were aligned using ClustalW B. Alignment of amino acid sequence of the 
CLASP-2 isoforms. Sequences were aligned using ClustalW. One letter amino acid 
abbreviation is used. 

20 Figure 4. Expression of CLASP-2 in human cell lines and human tissues as 

determined by Northern hybridization. A CLASP-2-specific DNA fragment was generated 
by PCR from a CLASP-2 cDNA clone (HC2-5'), using primers HC2AS2 and HC2S1. The 
fragment was labeled by incorporation of radioactive 32 P dCTP. A. Expression in human 
tissues. Thejabeled DNA fragment was used as a probe on a human Multiple Tissue 

25 Northern (Clonetech MTN Blot, #7780-1). A single band is clearly detect migrating at 
approximately 7.5 kb in placenta, heart kidney and lung in the Multiple Tissue Northern. 
Slight expression is detected in liver, skeletal muscle and brain. B. Expression in 
hematopoietic cell lines. A Northern with RNA from multiple cells lines was hybridized with 
the same hCLASP-2 probe. A similarly migrating band is detected in Jurkat (T-cell derived), 

30 9D1 0 (B-cell derived) and 293 (human kidney derived) cell lines. There are multiple weaker 
bands in the 9D10 lane indicating possible splice variants of hCLASP-2, Weak expression is 
also detected in the mouse cell lines CH27 (B cell lymphoma) and 3A9 (T-cell hybridoma). 
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Since hybridization and washing were carried out at high stringency, this indicates that the 
human CLASP-2 probe may cross-react with mouse CLASP mRNA. 

Figure 5. A. Amino acid sequence of human and rat CLASP proteins. 
Sequences were aligned using ClustalW. One letter amino acid abbreviation used. Protein 
motifs are found within the labeled boxes. A indicates gaps that are placed to acquire a 
best overall alignment. Other abbreviations: "HC2A" Human CLASP-2 sequence, "K1AA" 
KIAA1058 sequence (Genbank Accession No. AB028981), "rat" TRG gene (Genbank 
Accession No. X681 01 ), "HC4" Human CLASP-4 sequence, "HCl" Human CLASP- 1 
sequence, "HC3" Human CLASP-3 sequence, "HC5" Human CLASP-5 sequence. B. 
Alignment of DOCK motifs found within the human CLASPs and compared to canonical 
DOCK motifs. Consensus amino acids found within all DOCK motifs are also indicated. 

Figure 6. A. Nucleotide and predicted amino acid sequence of CLASP-2A 
cDNA. Notable protein motifs are indicated (see FIG. 1 legend for details). Additionally, 
boundaries between exons and introns are indicated by arrows. These boundaries were 
defined by sequencing Bacterial Artificial Chromosomes (BACs) containing genomic DNA 
corresponding to CLASP-2. BACs were sequenced using primers derived from exon 
sequences corresponding to the CLASP-2 cDNA. Each exon/intron boundary is noted (as 
"Ref" with an appropriate reference number) above the cDNA sequence. The references 
contain exact nucleotide location of introns. The names and nucleotide numbers of the 
primers that were used in sequence reactions are also indicated. All nucleotide numbers refer 
to CLASP-2A cDNA sequence. As shown in the reference, not all of the sequence from 
sequencing reactions produced sequence matching the cDNA. These nucleotide sequences 
that did not match the exon sequence for CLASP-2 were considered to be intron sequences. 
B. Alignment of human and rat CLASP amino acid sequences by ClustalW. Notable protein 
motifs are indicated (see FIG. 1 Legend for additional details). Additionally, the exon/intron 
borders described in part A are indicated with vertical lines between appropriate amino acids. 
Reference numbers are indicted in the right margin and correspond to references in Fig 6A 
and B. 

Figure 7. Southern hybridization analysis of CLASP-2. Genomic DNA from 
HeLa cells or a BAC DNA clone was digested with EcoRI or HinDIQ (genomic DNA) or Pst 
I (BAC DNA) and eletrophoresed and transferred to nylon membrane by standard methods. 
For a probe, a CLASP-2-specific DNA fragment was generated by PCR from a CLASP-2 
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cDNA clone (HC2-5'), using primers HC2AS2 and HC2SL The fragment was labeled by 
incorporation of radioactive 32 P dCTP. Probe HC2.1 is 800 bp long and it recognizes two 
fragments (—4.5 kb and 1.85 kb) on Eco Rl digested genomic DNA. Three fragments are 
revealed by this probe when hybridized to digested DNA of BACs 4 and 6, with the two 
5 major ones identical in size to those detected on genomic DNA. 

Figure 8. Expression of human CLASP- 1 (hCLASP-1) CLASP- 1 and 
CLASP-2 Glulathion-S-Transferase (GST) fusion proteins. Nucleotides encoding a portion 
of the hCLASP-2A intracellular domain (nucleotides 3230-4065) were subcloned into pGEX 
vectors (Pharmacia). Recombinant plasmids were transformed into E. coli (strain DH5a), 

10 and transformed strains were grown by standard conditions. While in log phase cells were 
either induced (I) with IPTG (0.1 mM final concentration) or left uninduced (U). After 
several additional hours of growth cells were harvested and soluble protein lysates generated 
by standard methods. Aliquots of the protein lysates were eletrophoresed on SDS-PAGE 
along with molecular mass standards. The gel was stained with Coomassie Blue and shows 

15 that fusion proteins migrated with their predicted molecular masses of 59 and 57 kD for 
hCLASP-1 and hCLASP-2, respectively. 

Figure 9. A. Binding of CLASP-2 C-terminal 20 amino acids to PDZ 
domains. 20 pM biotinylated synthetic peptide corresponding to the C-terminal 20 amino 
acids of CLASP-2 was reacted with the indicated plate bound GST fusion proteins (none = no 

20 GST fusion protein coated onto plate). Error bars indicate standard deviation of duplicate 
measurements. B. Affinity of CLASP 2 — PDZ interactions. Varying concentrations of 
biotinylated CLASP-2 peptide were reacted with plate bound GST alone, GST-DLG1, GST- 
NeDLG, and GST-PSD95 fusion proteins. The binding to GST alone (< 0.1 OD units) was 
subtracted from the binding to the fusion proteins and the remaining signal was divided by 

25 the signal observed upon addition of 30 jiM CLASP-2 peptide to each PDZ domain- 
containing protein (0.4 - 1.0 OD units) and plotted. The plotted data was fit to a saturation 
binding curve, yielding an apparent affinity of 7.5 jxM for NeDLG- CLASP-2 interaction, 2 1 
pM for DLG1- CLASP-2 interaction, and 45 pM for PSD95-CLASP-2 interaction. Data are 
means of duplicate data points, with standard errors between duplicate data points < 10%. C. 

30 Inhibition of CLASP-2 - PDZ binding. 5 pM biotinylated synthetic peptide corresponding to 
the C-terminal 20 amino acids of CLASP-2 was reacted with the indicated, plate-bound PDZ 
domain-containing GST fusion proteins in the presence or absence of 100 pM competitor 
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peptide. CLASP-2 Inhibitor refers to a synthetic peptide composed of the eight G-terminal 
amino acids of CLASP-2. KV1 .3 Inhibitor refers to a synthetic peptide composed of the 1 9 
C-termina! amino acids of KV1.3, a lymphocyte potassium channel. The amino acid 
sequence of the KV1 .3 inhibitor is TTNNNPNSAVNIKKIFTDV. D. Inhibition of KV1 .3 - 
5 PDZ binding. 5 p.M biotinylated synthetic peptide corresponding to the C-terminal 19 amino 
acids of KV1 .3 was reacted with the indicated, plate-bound PDZ-domain containing GST 
fusion proteins in the presence or absence of 100 |xM CLASP-2 Inhibitor (see FIG. 9C 
legend). 

Figure 10. Preliminary nucleotide sequences of CLASP-2 cDNAs. 

Figure 11. A) Full length cDNA sequence and predicted amino acid 
translation of the human CLASP-2 gene. Predicted initiator methionine starts at nucleotide 
+1. Three independent 1st exons (indicated as 1 1 A, 1 IB and 1 1C) splice into the second exon 
starting at nucleotide -101. The sequence appearing in FIG. 1 corresponds to nucleotides 
1884 through 6690 of FIG 1 1 A. B) Differences between the human CLASP-2 cDNA 
isoforms. In addition to the differential first exon usage indicated in A, sequencing multiple, 
independent cDNA products revealed nucleotide polymorphisms (allelic variations) between 
CLASP-2 cDNA isoforms. Additionally, differential exon usage through alternative splicing 
events was discovered. The use of the exon in B leads to a premature stop codon that can 
generate a soluble form of CLASP-2. C. Schematic of human CLASP-2 cDNAs. The top 
line represents nucleotide numbering found in FIG. 1 1 A. Line (i) represents CLASP-2 
cDNA shown in FIG. 1 above; line (ii) represents the full length CLASP-2 isoforms, where 
there are three CLASP-2 full length cDNA isoforms (A + Z,B + Z, and C~+ Z). Each of the 
isofoims uses a unique first exon (A, B, and C) (see FIG. 1 1 A) that splices into the rest of the 
cDNA from exon 2 onwards represented by Z. The portion of the cDNA represented by Z 
itself has alternative splice and nucleotide polymorphisms that are shown in FIG. 2 above. 
Line (iii) represents the additional 5' sequence with a small region of overlao between 
nucleotides 1884 to 2109 in FIG. 1 1 A and nucleotides 1-225 of FIG. 1. 

Figure 12. Sequence of human CLASP-2 exons and intron boundaries. A 
Sequence of human CLASP-2 exons and intron borders. Stretches of noncontigous genomic 
30 sequence from the Human Genome Project (GENBANK entry gi9988160) were aligned 

using the human CLASP-2 cDNA as a template and Sequencher sequence analysis software 
(Gene Codes Corp). 22 exons representing approximately the 5' 20% of the human CLASP- 
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2 cDNA sequence are presented in predicted 5* to 3' order. Exon sequences are underlined 
and are flanked by intron sequence. Nucleotide numbers in parentheses refer to the exon 
sequence within the uniquely-generated, contiguous gi9988160 sequence, which is located in 
B. B. Ordered stretch of human genomic DNA at the CLASP-2 locus aligned from 
5 noncontiguous, shotgun sequencing from the Human Genome Project using the human 
CLASP-2 sequence from FIG. 5A to determine genomic DNA fragment order and 
orientation. 

Figure 13. Amino acid alignment and comparison between the human (h) 
CLASP family members. Amino acid sequences were aligned using ClustalW. The 

10 alignment is presented in order of their greatest pairwise similarity scores. Single letter 
amino acid abbreviations are used. Astericks indicate complete identity, while colons and 
periods indicate sequence similarity among CLASP family members. Dashes indicate gaps 
inserted in the amino acid sequence to facilitate alignment. Labelled boxes are domains with 
similarity to known protein motifs; unlabelled boxes represent regions of similarity between 

1 5 all CLASPs and may represent CLASP-specific domains. 

Figure 14. Expression of CLASP-2 upon T-cell activation as assayed by 
Northern analysis. Jurkat cells were activated using PMA, Ionomycin, and aCD28. RNA was 
prepared from cell culture aliquots at 0 , 1, 2, 4, 8, 14 hours post activation and Northern 
analysis was performed (A). Hybridization signals obtained with a CLASP-2-specific probe 
20 were quantified using a phosphor imager system. Relative signal intensities (refers to total 

signal of the specific probe used) are shown in the bar diagram (B). The ethidium staining of 
the Northern gel (A) demonstrates even RNA loading. 

DETAILED DESCRIPTION 

5.0 Definitions 

25 Except when noted, the terms "patient" or "subject" are used interchangeably 

and refer to mammals such as human patients and non-human primates, as well as 
experimental animals such as rabbits, rats, and mice, and other animals. 

The term "biological sample" as used herein is a sample of biological tissue, 
fluid, or cells that contains hCLASP-2 or nucleic acid encoding hCLASP-2 protein. Such 

30 samples include, but are not limited to, tissue isolated from humans. Biological samples may 
also include sections of tissues such as frozen sections taken for histologic purposes. A 
biological sample is typically obtained from a eukaryotic organism, preferably eukaryotes 
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such as fungi, plants, insects, protozoa, birds, fish, reptiles, and preferably a mammal such as 
rat, mice, cow, dog, guinea pig, or rabbit, and most preferably a primate such as chimpanzees 
or humans. 

The term "treating" includes the administration of the compounds or agents of 
5 the present invention to prevent or delay the onset of the symptoms, complications, or 

biochemical indicia of a disease, alleviating the symptoms or arresting or inhibiting further 
development of the disease, condition, or disorder (e.g., autoimmune disease). Treatment 
may be prophylactic (to prevent or delay the onset of the disease, or to prevent the 
manifestation of clinical or subclinical symptoms thereof) or therapeutic suppression or 
10 alleviation of symptoms after the manifestation of the disease. 

The term "lymphocyte" as used herein has the normal meaning in the art, and 
refers to any of the mononuclear, nonphagocytic leukocytes, found in the blood, lymph, and 
lymphoid tissues, i.e., B and T lymphocytes. 

The terms "isolated," or "purified," refer to material that is substantially free 
1 5 from components that normally accompany it as found in its native state (e.g., recombinantly 
produced or purified away from other cell components with which it is naturally associated). 
Purity and homogeneity are typically determined using analytical chemistry techniques such 
as polyacrylamide gel electrophoresis or high performance liquid chromatography. The term 
"purified" denotes that a nucleic acid or protein gives rise to essentially one band in an 
20 electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, 
more preferably at least 95% pure, and most preferably at least 99% pure. 

The terms "nucleic acid" and "polynucleotide" are used interchangeably" and 
refer to refers to DNA, RNA and nucleic acid polymers containing known nucleotide analogs 
or modified backbone residues or linkages, which are synthetic, naturally occurring, and non- 
25 naturally occurring, which have similar binding properties as the reference nucleic acid, and 
which are metabolized in a manner similar to the reference nucleotides. Examples of such 
analogs include, without limitation, phosphorothioates, phosphoramidates, methyl 
phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids 
(PNAs). 

30 The terms "polypeptide," "peptide" and "protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The amino acids may be natural amino 
acids, or include an artificial chemical mimetic of a corresponding naturally occurring amino 
acid. 
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As used herein a "nucleic acid probe" is defined as a nucleic acidxapable of 
specifically binding to a target nucleic acid of complementary sequence (e.g., through 
complementary base pairing). As used herein, a probe may include natural (i.e., A, G, C, or 
T) or modified bases (7-deazaguanosine, inosine, and the like). In addition, the bases in a 
5 probe may be joined by a linkage other than a phosphodiester bond, so long as it does not 
interfere with hybridization (e.g., probes may be peptide nucleic acids). The probes can be 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which a streptavidin complex may later bind. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 

10 acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 

modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, on in the case of cells, to .progeny of a cell so modified. Thus, 
for example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 

15 expressed, under expressed or not expressed at all. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences from unrelated genes arranged to 

20 make a new functional nucleic acid, e.g., a promoter from one source and a coding region 
from another source. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 

The term "sequence identity" refers to a measure of similarity between amino 

25 acid or nucleotide sequences, and can be measured using methods known in the art, such as 
those described below: 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

30 same (z.e., 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity over a 
specified region (see, e.g., SEQ ED NO: 1 ), when compared and aligned for maximum 
correspondence over a comparison window, or designated region as measured using one of 
the following sequence comparison algorithms or by manual alignment and visual inspection. 
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The phrase "substantially identical," in the context of two nucleicacids or 
polypeptides, refers to two or more sequences or subsequences that have at least of at least 
60%, often at least 70%, preferably at least 80%, most preferably at least 90% or at least 95% 
nucleotide or amino acid residue identity, when compared and aligned for maximum 
correspondence, as measured using one of the following sequence comparison algorithms or 
by visual inspection. Preferably, the substantial identity exists over a region of the sequences 
that is at least about 50 bases or residues in length, more preferably over a region of at least 
about 100 bases or residues, and most preferably the sequences are substantially identical 
over at least about 150 bases or residues. In a most preferred embodiment, the sequences are 
substantially identical over the entire length of the coding regions. 

The phrase "sequence similarity" in the context of two nucleic acids or 
polypeptides, refers to two or more sequences that are identitical or in the case of amino 
acids, have homologous amino acid substitutions at either 50%, often at least 60%, often at 
least 70%, preferably at least 80%, most preferably at least 90% or at least 95% of the 
indicated positions. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Default program 
parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. For sequence 
comparison of nucleic acids and proteins to CLASP-2 nucleic acids and proteins, the BLAST 
and BLAST 2.0 algorithms and the default parameters discussed below are used. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 
to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, 1981 , Adv. 
Appl. Math. 2: 482), by the homology alignment algorithm of Needleman & Wunsch, 1970, 
J. Mol. Biol. 48: 443, by the search for similarity method of Pearson & Lipman, 1988, Proc. 
Natl. Acad. Sci. U.S.A. 85: 2444, by computerized implementations of these algorithms 
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(FASTDB (Intelligenetics), BLAST (National Center for Biomedical Information!, GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual 
inspection (see, e.g.. Ausubel et al. y 1987 (1999 Suppl.), Current Protocols in Molecular 
5 Biology, Greene Publishing Associates and Wiley Interscience, N.Y.) 

A preferred example of an algorithm that is suitable for determining percent 
sequence identity and sequence similarity is the FASTA algorithm, which is described in 
Pearson, W.R. & Lipman, D.J., 1988, Proc. Natl. Acad. Sci. U.S.A. 85: 2444. See also W. 
R. Pearson, 1996, Methods Enzymol. 266: 227-258. Preferred parameters used in a FASTA 
10 alignment of DNA sequences to calculate percent identity are optimized, BL50 Matrix 15: -5, 
k-tuple= 2; joining penalty= 40, optimization= 28; gap penalty -12, gap length penalty =-2; 
and width= 16. 

Another preferred example of algorithm that is suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, 

15 which are described in Altschul et al, 1977, Nuc. Acids Res. 25: 3389-3402 and Altschul et 
al y 1990, J. Mol. Biol. 215: 403-410, respectively. BLAST and BLAST 2.0 are used, with 
the parameters described herein, to determine percent sequence identity for the nucleic acids 
and proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http: //www.ncbi.nlm.nih.gov/). 

20 This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et aL> supra). 
These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 

25 containing them. The word hits are extended in both directions along each sequence for as 
far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 

30 word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
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uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=^5, N— 4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, 1989, Proc. Natl. Acad. Sci. U.S.A. 89: 10915) alignments (B) of 
5 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, 1993, Proc. Natl. Acad. Sci. U.S.A. 90: 
5873-5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
1 0 two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01 , and most preferably less than about 0.001 . 

Another example of a useful algorithm is PILEUP. PILEUP creates a multiple 
15 sequence alignment from a group of related sequences using progressive, pairwise alignments 
to show relationship and percent sequence identity. It also plots a tree or dendogram showing 
the clustering relationships used to create the alignment. PILEUP uses a simplification of the 
progressive alignment method of Feng & Doolittle, 1987, J. Mol. Evol. 35: 351-360. The 
method used is similar to the method described by Higgins & Sharp, 1989, CABIOS 5:151- 
20 153. The program can align up to 300 sequences, each of a maximum length of 5,000 
nucleotides or amino acids. The multiple alignment procedure begins with the pairwise 
alignment of the two most similar sequences, producing a cluster of two aligned sequences. 
This cluster is then aligned to the next most related sequence or cluster of aligned sequences. 
Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two 
25 individual sequences. The final alignment is achieved by a series of progressive, pairwise 
alignments. The program is run by designating specific sequences and their amino acid or 
nucleotide coordinates for regions of sequence comparison and by designating the program 
parameters. Using PILEUP, a reference sequence is compared to other test sequences to 
determine the percent sequence identity relationship using the following parameters: default 
30 gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be 
obtained from the GCG sequence analysis software package, e.g., version 7.0 (Devereaux et 
al., 1984, Nuc. Acids Res. 12: 387-395. 

Another preferred example of an algorithm that is suitable for multiple DNA 
and amino acid sequence alignments is the CLUSTALW program (Thompson, J. D. et al y 
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1994, Nucl. Acids. Res. 22: 4673-4680). ClustalW performs multiple pairwise comparisons 
between groups of sequences and assembles them into a multiple alignment based on 
homology. Gap open and Gap extension penalties were 10 and 0.05 respectively. For amino 
acid alignments, the BLOSUM algorithm can be used as a protein weight matrix (Henikoff 
5 and Henikoff, 1992, Proc. Natl. Acad. Sci. U.S.A. 89: 10915-10919). 

A "label" is a composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include 32 P, 
fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), 
biotin, digoxigenin, or haptens and proteins for which antisera or monoclonal antibodies are 
10 available (e.g., the polypeptide of SEQ ID NO: 1 can be made detectable, e.g., by 

incorporating a radiolabel into the peptide, and used to detect antibodies specifically reactive 
with the peptide). 

The term "sorting" in the context of cells as used herein to refers to both 
physical sorting of the cells, as can be accomplished using, e.g., a fluorescence activated cell 
15 sorter, as well as to analysis of cells based on expression of cell surface markers, e.g., FACS 
analysis in the absence of sorting. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
20 total cellular or library DNA or RNA). 

The phrase "specifically (or selectively) binds" to an antibody refers to a 
binding reaction that is determinative of the presence of the protein in a heterogeneous 
population of proteins and other biologies. Thus, under designated immunoassay conditions, 
the specified antibodies bind to a particular protein at least two times the background and do 
25 not substantially bind in a significant amount to other proteins present in the sample. 

The phrase "specifically bind(s)" or "bind(s) specifically" when referring to a 
peptide refers to a peptide molecule which has intermediate or high binding affinity, 
exclusively or predominately, to a target molecule. The phrases "specifically binds to" refers 
to a binding reaction which is determinative of the presence of a target protein in the presence 
30 of a heterogeneous population of proteins and other biologies. Thus, under designated assay 
conditions, the specified binding moieties bind preferentially to a particular target protein and 
do not bind in a significant amount to other components present in a test sample. Specific 
binding to a target protein under such conditions may require a binding moiety that is 
selected for its specificity for a particular target antigen. A variety of assay formats may be 
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used to select ligands that are specifically reactive with a particular protein. Fo^example, 
solid-phase ELISA immunoassays, immurioprecipitation, Biacore and Western blot are used 
to identify peptides that specifically react with PDZ domain-containing proteins. Typically a 
specific or selective reaction will be at least twice background signal or noise and more 
typically more than 10 times background. Specific binding between a monovalent peptide 
and a PDZ-containing protein means a binding affinity of at least 10 4 M'\ and preferably 10 5 
orl0 6 M _I . 

The phrase "homotypic interaction" refers to the binding of a given protein to 
another molecule of the same protein {e.g., the binding of hCLASP-2 to hCLASP-2). The 
phrase "heterotypic interaction" refers to the binding of a given protein to a different protein 
or other molecule {e.g., the binding of hCLASP-2 to a PDZ domain-containing protein or the 
binding of a transcription factor to DNA). 

The phrase "immune cell response" refers to the response of immune system 
cells to external or internal stimuli {e.g., antigen, cytokines, chemokines, and other cells) 
producing biochemical changes in the immune cells that result in immune cell migration, 
killing of target cells, phagocytosis, production of antibodies, other soluble effectors of the 
immune response, and the like. 

The terms "B lymphocyte response" and "B lymphocyte activity" are used 
interchangeably to refer to the component of immune response carried out by B lymphocytes 
(i.e. the proliferation and maturation of B lymphocytes, the binding of antigen to cell surface 
immunogobulin, the internalization of antigen and presentation of that antigen via MHC 
molecules to T lymphocytes, and the synthesis and secretion of antibodies). 

The terms "T lymphocyte response" and "T lymphocyte activity" are used 
here interchangeably to refer to the component of immune response dependent on T 
lymphocytes (Le. 9 the proliferation and/or differentiation of T lymphocytes into helper, 
cytotoxic killer, or suppressor T lymphocytes, the provision of signals by helper T 
lymphocytes to B lymphocytes that cause or prevent antibody production, the killing of 
specific target cells by cytotoxic T lymphocytes, and the release of soluble factors such as 
cytokines that modulate the function of other immune cells). 

The term "immune response" refers to the concerted action of lymphocytes, 
antigen presenting cells, phagocytic cells, granulocytes, and soluble macromolecules 
produced by the above cells or the liver (including antibodies, cytokines, and complement) 
that results in selective damage to, destruction of, or elimination from the human body of 
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invading pathogens, cells or tissues infected with pathogens, cancerous cells, or,in cases of 
autoimmunity or pathological inflammation, normal human cells or tissues. 

Components of an immune response may be detected in vitro by various 
methods that are well known to those of ordinary skill in the art. For example, (1) cytotoxic 
5 T lymphocytes can be incubated with radioactively labeled target cells and the lysis of these 
target cells detected by the release of radioactivity, (2) helper T lymphocytes can be 
incubated with antigens and antigen presenting cells and the synthesis and secretion of 
cytokines measured by standard methods (Windhagen A; et al., 1995, Immunity 2(4): 373- 
80), (3) antigen presenting cells can be incubated with whole protein antigen and the 

10 presentation of that antigen on MHC detected by either T lymphocyte activation assays or 

biophysical methods (Harding et al. y 1989,, Proc. Natl. Acad. Sci., 86: 4230-4), (4) mast cells 
can be incubated with reagents that cross-link their Fc-epsilon receptors and histamine release 
measured by enzyme immunoassay (Siraganian, et aL 9 1983, TIPS 4: 432-437). 

Similarly, products of an immune response in either a model organism (e.g., 

15 mouse) or a human patient can also be detected by various methods that are well known to 

those of ordinary skill in the art. For example, (1) the production of antibodies in response to 
vaccination can be readily detected by standard methods currently used in clinical 
laboratories, e.g., an EL1SA; (2) the migration of immune cells to sites of inflammation can 
be detected by scratching the surface of skin and placing a sterile container to capture the 

20 migrating cells over scratch site (Peters et al. 9 1988, Blood 72: 1310-5); (3) the proliferation 
of peripheral blood mononuclear cells in response to mitogens or mixed lymphocyte reaction 
can be measured using 3 H-thymidine; (4) the phagocitic capacity of granulocytes, 
macrophages, and other phagocytes in PBMCs can be measured by placing: PMBCs in wells 
together with labeled particles (Peters et al. 9 1988); and (5) the differentation of immune 

25 system cells can be measured by labeling PBMCs with antibodies to CD molecules such as 
CD4 and CD8 and measuring the fraction of the PBMCs expressing these markers. 

As used herein, the phrase "signal transduction pathway" or "signal 
transduction event" refers to at least one biochemical reaction, but more commonly a series 
of biochemical reactions, which result from interaction of a cell with a stimulatory compound 

30 or agent. Thus, the interaction of a stimulatory compound with a cell generates a "signal" that 
is transmitted through the signal transduction pathway, ultimately resulting in a cellular 
response, e.g., an immune response described above. 

A signal transduction pathway refers to the biochemical relationship between a 
variety of signal transduction molecules that play a role in the transmission of a signal from 
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one portion of a cell to another portion of a cell. Signal transduction molecules of the present 
invention include, for example, extracellular and intracellular domains of CLASP-2. As used 
herein, the phrase "cell surface receptor" includes molecules and complexes of molecules 
capable of receiving a signal and the transmission of such a signal across the plasma 
5 membrane of a cell. An example of a "cell surface receptor" of the present invention is the T 
cell receptor (TCR). As used herein, the phrase "intracellular signal transduction molecule" 
includes those molecules or complexes of molecules involved in transmitting a signal from 
the plasma membrane of a cell through the cytoplasm of the cell, and in some instances, into 
the cell's nucleus. In the present invention, CLASP-2 can be referred to as an "intracellular 
10 signal transduction molecule", but can also be referred to as a "signal transduction molecule". 

A signal transduction pathway in a cell can be initiated by interaction of a cell 
with a stimulator that is inside or outside of the cell. If an exterior (i.e., outside of the cell) 
stimulator (e.g., an MHC-antigen complex on an antigen presenting cell) interacts with a cell 
surface receptor (e.g., a T cell receptor), a signal transduction pathway can transmit a signal 
1 5 across the cell's membrane, through the cytoplasm of the cell, and in some instances into the 
nucleus. If an interior (e.g., inside the cell) stimulator interacts with an intracellular signal 
transduction molecule, a signal transduction pathway can result in transmission of a signal 
through the cell's cytoplasm, and in some instances into the cell's nucleus. 

Signal transduction can occur through, e.g., the phosphorylation of a molecule; 
20 non-covalent allosteric interactions; complexing of molecules; the conformational change of 
a molecule; calcium release; inositol phosphate production; proteolytic cleavage; cyclic 
nucleotide production and diacylglyceride production. Typically, signal transduction occurs 
through phosphorylating a signal transduction molecule. According to the present invention, 
a CLASP-2 signal transduction pathway refers generally to a pathway in which CLASP-2 
25 protein regulates a pathway that includes engaged-receptors, PKC-substrates, G proteins, and 
other molecules. 

5.1. Introduction 

The present invention relates to a novel transmembrane protein, CLASP-2, a 
new member of the CLASP family that contains an endodomain that displays the appropriate 
30 properties to organize the cytoskeleton and signal transduction apparatus of the immune 
gateway. 

CLASP-2 functions in cells of the immune system, e.g., T cells and B cells, as 
well as non-immune cells. The CLASP-2 protein functions in a variety of cellular processes, 
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particularly related to immune function, regulation of T cell and B cell interactions, T cell 
activation, and in the organization, establishment and maintenance of the "immunological 
synapse" (see Dustin et aL % 1999, Science 283: 680-682; Paul et al. y 1994, Cell 76: 241-251 ; 
Dustin et a/., 1996, J. Immunol. 157: 2014; Dustin et al. y 1998, Cell 94: 667), including 
signal transduction, cytoskeletal interactions, and membrane organization. 

Without intending to be bound by a particular mechanism or limited in any 
way, the CLASP-2 protein is belie'ved to be a component of the lymphocyte organelle called 

* 

the "immune gateway" that creates a docking site or portal for cell-cell contact during 
antigen-presentation. It is believed the cytoplasmic domains of CLASP-2 proteins organize it 
into a patch at the leading edge of T cells. The carboxy-terminus encoded sequences mediate 
interaction with PDZ domain proteins and with cytoskeletal proteins (e.g., spectrin or 
ankyrin) to connect CLASP-2 to the microtubule network and hold the receptors at a 
polarized configuration just above the microtubule-organizing center ("MTOC")- Thus, 
when T cells engages a B cell acting as an APC, the CLASP-2 molecules engage one another 
to dock the two cells and organize the immune synapse. 

Modulating the expression of the CLASP-2 protein, and interference with, or 
enhancement of, CLASP-2 protein interactions with other proteins has a number of beneficial 
physiological effects, e.g., altered signaling in response to antigen, altered T and B cell 
response to antigen, and modulation of T cell activation. In one aspect, the CLASP-2 
extracellular domain is targeted (e.g., using anti-CLASP-2 antibody, soluble CLASP-2 
fragments, and the like) to regulate T cell activation (and thus regulate immune responses). 
Disorders that can be treated by disrupting CLASP-2 function, include without limitation, 
multiple sclerosis, juvenile diabetes, rheumatoid arthritis, pemphigus, pemphigoid, 
epidermolysis bullosa acquista, lupus, endometriosis, toxemia or pregnancy induced 
hypertension, pruritic urticarial papules and plaques of pregnancy (PUPPP), herpes 
gestationis, impetigo herpetiformis, pruritus gravidarum, placenta-related disorders, and Rh 
incompatibility. 

In another aspect, the present invention provides methods and reagents for 
detection of CLASP-2 expression and CLASP-2-expressing cells. Abnormal expression 
patterns or expression levels are diagnostic for immune and other disorders. For example, 
diseases characterized by overproduction or depletion of lymphocytes in blood or other 
organs may be detected or monitored by monitoring the level of CLASP-2 polypeptide or 
mRNA in a biological sample (e.g., peripheral blood), e.g., the number or percentage of 
CLASP-2 expressing cells. Diseases characterized by overproduction of T cells include, e.g., 
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leukemia (both ALL and CLL), lymphoma (including non-Hodgkins lymphoma, Burkitt's 
lymphoma, mycosis fungoides, and sezary syndrome), EBV, CMV, toxoplasmosis, syphilis, 
typhoid, brucellosis, tuberculosis, influenza, hepatitis, serum sickness, and thyrotoxicosis. 
Diseases associated with the depletion of T cells include, e.g., HIV and myelodysplasia. 
5 Diseases associated with the overproduction of B cells include, e.g., leukemia (both ALL and 
CLL), non-Hodgkins lymphoma, Burkitt's lymphoma, myeloma, EBV, CMV, toxoplasmosis, 
syphilis, typhoid, brucellosis, tuberculosis, influenze, hepatitis, serum sickness, and 
thyrotoxicosis Diseases associated with the depletion of B cells include, e.g., 
myelodysplasia. 

10 5.2. CLASr-2 cDNA and Polypeptide Structure 

The CLASP-2 protein is type I transmembrane glycoprotein, characterized by 
multiple forms produced by alternative exon usage {i.e., production of splice variants). In 
one naturally occurring form, CLASP-2 has the structure shown in FIG. 1. However, as 
discussed in detail infra, the CLASP-2 gene encodes a variety of gene product due to 
15 alternative splicing of mRNA. FIG. 2 shows the nucleotide sequence and conceptual 
translation of human CLASP-2 polypeptides: 

hCLASP-2A cDNA (SEQ ED NO: 1) and hCLASP-2A polypeptide (SEQ ID 

NO: 2). 

hCLASP-2B cDNA (SEQ ID NO: 3) and hCLASP-2B polypeptide (SEQ ID 

20 NO: 4). 

hCLASP-2C cDNA (SEQ ED NO: 5) and hCLASP-2C polypeptide (SEQ ID 

NO: 6). 

hCLASP-2D cDNA (SEQ ID NO: 7) and hCLASP-2D polypeptide (SEQ ID 

NO: 8). 

25 hCLASP-2E cDNA (SEQ ID NO: 9) and hCLASP-2E polypeptide (SEQ ID 

NO: 10). 

Unless specifically referred to, the phrase "human CLASP-2 (hCLASP-2)" is 
used hereinrefers to hCLASP-2A, hCLASP-2B, hCLASP-2C and hCLASP-2E. "hCLASP- 
2D" cDNA is also known as KIAA1058, which was described by Kikuno et al, 1999, DNA 
30 Res. 6, 197-205 as a cDNA from brain encoding a protein of unknown function. 

CLASP-2 polypeptides typically include an approximately 120 residue leader 
sequence, followed by a cadherin proteolytic cleavage signal RXXR, an extracellular domain, 
a transmembrane domain, and an intracellular domain. The present invention provides a 
polynucleotide having the sequence of SEQ. ID. NO: 1, or a fragment thereof, and a 
35 polypeptide having the sequence of SEQ. ED NO: 2, or a fragment thereof. In addition, the 
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invention provides polynucleotides comprising hCLASP-2 genomic sequences, CLASP-2 
homologs from other species, naturally occurring alleles of hCLASP-2, and hCLASP-2 
variants as described herein, and methods for using CLASP-2 polynucleotide, polypeptides, 
antibodies and other reagents. 

5 5.2.1. CLASP-2 Polypeptide Domains 

As is shown in FIG. 1 , one naturally occurring CLASP-2 cDNA encodes a 
polypeptide characterized by several structural and functional domains and defined sequence » 
motifs. To provide guidance to the practitioner, the structural features are described infra. 
However, it will be understood that the present invention is not limited to polypeptides that 
1 0 include all, or any particular one of these domains or motifs. For example, a CLASP-2 fusion 

orotein of the invention contains onlv the extracellular domain of CLASP-2. Similarly, the 

■ ■ -• , 

CLASP-2 A polypeptide of SEQ ID NO: 2 does not have the IT AM motifs (discussed infra) 
found in the CLASP-2B and 2C polypeptides. 

It will be appreciated that the structurally (and functionally) different domains 

15 of CLASP-2 polypeptides (and the corresponding region of the mRNA) are of interest, in 

part, because they may be separately targeted or modified {e.g., deleted or mutated) to affect 
the activity or expression of a CLASP-2 gene product (in order to, for example, modulate an 
immune response). For example, the extracellular domain of a CLASP-2 protein can be 
targeted {e.g., using an anti-CLASP monoclonal antibody to (a) block the interaction of a 

20 CLASP-2-expressing cell {e.g., a T cell) and a second cell {e.g., a B cell) displaying a protein 
that is bound by CLASP-2 {i.e., a CLASP-2 ligand). Similarly, an intracellular domain {e.g., 
ITAM or DOCK, see infra) can be targeted to interfere with signal transduction without 
interfering with extracellular ligand binding. 

Generally, inhibiting CLASP-2 expression or CLASP-2 polypeptide function 

25 will result in modulation of immune function including, for example, changing the threshold 
for T cell activation by affecting formation of the immune synapse. Modulation of immune 
function can be screened and quantitated by a number of assays known in the art and 
described herein (see also §5.14). 

5.2.1.1. Signal Peptide 

30 The human CLASP-2 sequence presented in FIG. 1 encodes two potential start 

sites for translation. The first predicted methionine appears at nucleotide 278 (ATG). The 
second methionine appears at nucleotide 476. Both have an acceptable consensus sequence 
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for a translational start (A/GxxATGG; Kozak, M., 1996, Mamm. Genome 7(8): 563-74 ). A 
polypeptide beginning at the second methionine is also predicted to encode a signal peptide 
capable of localizing the protein to the secretory pathway by SignalP, a signal sequence 
prediction program (Nielsen, H. ei a/. y 1997, Protein Eng. 10(1): 1-6). Polypeptides 
5 beginning at the first methionine are not predicted to contain a signal sequence; however, the 
consensus for signal prediction is only 80-90% accurate for known signal sequences. A third 
possibility for a translational start is that the cDNA listed in FIG. 1 is incomplete and another 
methionine is encoded in frame and upstream of the sequence shown in FIG.l . 

5.2.1.2. Extracellular Domain 

1 0 The CLASP-2 extracellular domain is characterized by one cadherin EC-like 

motif (Pigott, R. and Power, C, 1993, The Adhesion Molecule Factbook. Academic Press, 
pg. 6; Jackson, R. M. and Russell, R. B., 2000, J. Mol. BioL 296: 325-34). Several highly 
conserved cysteines are found in the extracellular domain, as well as various glycosylation 
signals. Through its extracellular domains, CLASP-2 may interact with ligands in a 

1 5 homotypic and/or heterotypic manner to establish the immunological synapse in conjunction 
with molecules such as TCR, MHC class I, MHC class II, CD3 complex and accessory 
molecules such as CD4, CD3, ICAM-1, LFA-1, and others. Many cadherins contain a pro- 
domain of approximately 50 to 150 amino acids that is removed before localization to the 
plasma membrane. This cleavage is presumed to be carried out by Furin (Posthaus, H. et al. 9 

20 1998, FEBS Let 438: 306-10) at a consensus sequence of RKQR. Furin is a protease that is at 
least partially responsible for the maturation of certain cadherins. CLASP-2 has the sequence 
RNQR at nucleotides 945 through 957. By homology, this region is around 120 amino acids 
into the predicted protein start site for hCLASP-2A. This region may be a pro-domain and 
cleavage may be required for CLASP-2 function, or aspects of CLASP-2 function. 

25 Antibodies raised against the extracellular domain can be added to cells 

expressing CLASP-2. These antibodies can either block the interaction of CLASP-2 with 
potential ligands or stabilize these interactions. Any immunoassay known in the art, e.g., 
listed and described herein, may be used to assess the modulation of immune function 
brought about by this approach. 

30 Similarly, portions of the extracellular domain of CLASP-2 can be expressed 

as soluble protein. This soluble protein can then be added to cells expressing CLASP-2. 
These proteins may interact with potential ligands to competitively inhibit their binding to 
endogenous CLASP-2. This could modulate CLASP-2 function via the immunoassays 
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described herein. Recombinant proteins could interfere in a positive or negative fashion with 
CLASP-2 interactions. 

5.2.1.3. Transmembrane Domain 

CLASP-2 predicted amino acid sequence was analyzed using the PHDhtm 
5 analysis software for prediction of transmembrane helices (Rost, B., et aL % 1996, Prot. 

Science 7: 1704-1 718). Using the PPHDhtm analysis software, it was determined that the 
transmembrane domain is located from nucleotides 2861-2917 (see FIG. 1), as well as three 
other potential transmembrane domains located near the amino terminal end. 

5.2.1.4. Intracellular Domains 

10 The CLASP-2 intracellular domains contain motifs corresponding to several 

types of protein domains. Depending on the specific CLASP-2 (i.e., specific family member 
or splice variant) all or only some of the domains can be present. Listed from amino terminus 
to carboxy terminus, the domains include: (1) IT AM (Chan et al 1994, Annual Review of 
Immunology 12: 555-592), (2) a newly discovered DOCK/CLASP-2 motif, (3) a coiled-coil 

15 motif, and (4) a C-terminal PDZ binding motif (PBM) (also referred to as PDZ ligand or 

"PL"). 

5.2.1.5. ITAM 

Immunoreceptor Tyrosine-based Activation Motifs (ITAM motifs; also known 
as ARAM, or antigen recognition activation motifs) are motifs contained within antigen 

20 receptors for T and B cells, and Fc receptors on other leukocytes, and are necessary for 
proper activation and signal transduction in these cells. They are characterized by the 
consensus sequence YXXL/I - X7/8- YXXL/I (Grucza et al, 1999, Biochemistry 38: 5024- 
5033), usually separated by 6-8 amino acids (Watson et at., 1998, Immunol. Today 19: 260- 
264; Isakov, J. Leukoc. Biol. 61: 6-16). ITAM is used as an intracellular regulatory motif 

25 through its ability to be tyrosine phosphorylated by src-family tyrosine kinases such as Lyn 
' that are involved in leukocyte signal transduction. Once phosphorylated, the ITAM acts as a 
high affinity binding site for SH2 containing proteins. Signal transduction components 
including ZAP-70, Syk, Lyn, She, PI3 kinase, and Grb2 contain SH2 domains and have been 
shown to bind ITAMs (Clements e^/., 1999, Aiuiu. Rev. Immunol. 17:89-108). This 

30 places ITAM-containing molecules in a central role of intracellular signal regulation in 
leukocytes. ITAM motifs in leukocyte signaling can facilitate signal transduction (e.g. ', 
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10 


15 


20 


tyrosine kinase signaling) by acting as temporal scaffolds where other transduction 
components could bind and be properly positioned to mediate transduction. IT AM motifs 
often appear in multiples in a protein, however, it is known that one set of YXXL/I alone can 
transduce signals of the PTK pathway, though weakly. 

CLASP-2 proteins typically have ITAM YXXL/I motifs (where X is any 
amino acid) separated by 3 or 13 amino acids. In various embodiments the CLASP-2 
polypeptide of the invention is characterized by one or more of the motifs shown in Table 1. 

Table 1 
CI .ASP-2 ITAM Motifs 


25 


MotifNo. 

Sequence Motif 

1 

YXX(I/L)-X 3 -YXX(I/L) 

2 

3 1 

YXX(I/L)-Xn-YXX(I/L) — 

YXX(I/L)-X 3 -YXX(I/L)-X 13 -YXX(I/L) 


30 


The presence of multiple ITAM motifs in CLASPs proteins indicates that they 
may be engaged by multiple signal transduction components (e.g., ZAP-70/Syk, She, PI3 
kinase, and Grb2). In general, the ITAM motif in CLASP proteins match identically to the 
canonical ITAM motif with some motifs containing a conservative amino acid change (i.e. 
valine instead of isoleucine or leucine). As previously described for other ITAMs, the 
ITAMs within CLASPs can bind SH2-containing proteins including ZAP-70, Syk, She, PI3 
kinase, and Grb2. Since CLASPs have an extracellular domain, CLASPs protein can 
independently initiate a signal transduction cascade through engagement of its extracellular 
domain. Otherwise CLASPs may cooperate with an antigen receptor signaling complex (e.g., 
with CD3/TCR, BCR, FcR), to facilitate tyrosine kinase signal transduction 

The ITAMs have demonstrated different binding specificity and affinities for 
SH2 domains (Clements, et al, 1999, Ann. Rev. Immunol. 17: 89-108). For example, She, 
PI3 kinase~and Grb2 bind to dual and mono phosphorylated ITAMs with different affinities. 
Thus the ITAMs inCLASPs are believed to provide quantitative as well as qualitative 
differences in signal transduction depending up their phosphorylation state, as well as to 
inhibit or augment specific protein interactions and hence specific tyrosine kinase-mediated 

signaling pathways in leukocytes. 

Antagonizing the PTK-CLASP-2 interaction (e.g., phosphorylation of 
CLASP-2) will thus inhibit immune function. In one embodiment, interactions between 
ITAM-bearing human CLASPs and their binding partners are believed to be antagonized by 
the alpha subtype (SIRPalpha) of signal regulatory proteins that has been shown to negatively 
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regulate IT AM -dependent lymphocyte activation (Lienard H; 1999, J Biol Chem^74: 32493- 
9). Also, a recently recognized family of immunoreceptor tyrosine-based inhibition motif 
(ITIM) receptors are thought to inhibit the ITAM-induced activation of immune competent 
cells (Gergely, et a/., 1999, J. Immunol Lett 68: 3-15) and therefore may block CLASP- 
5 partner interaction. 

5.2.1.6. DOCK 

CLASP-2 polypeptides contain a new "DOCK" motif, not previously 
described in the scientific literature. The CLASP DOCK motif includes a series of five 
tyrosines surrounded by conserved sequences in regions A, B, C, D, and G (see FIG. 5B). 
10 There are also two highly conserved non-tyrosine containing regions (E and G) separated by 
nine amino acids (P+EXAI+XM) and (LXMXL+GXVXXXVNXG) (where X is any amino 
acid). 

The cytoplasmic region of CLASP-2 immediately following the IT AM 
domains exhibits sequence similarity to the C-terminal third of the so-called "DOCK" 

15 proteins. The DOCK gene family includes three molecules that are the human homologues 
of the C. elegans CED proteins known to be involved in apoptosis. CED-5 (DOCK180), a 
major CRK-binding protein, alters cell morphology upon translocation to the membrane ( 
mediates the membrane motion that scavenger cells exhibit as they surround and engulf dying 
cells; its function can be partially rescued by the human DOCK180 (Wu et aL, 1998, Nature 

20 392: 501-504). Myoblast City in Drosophila (MBC) is another member of the DOCK protein 
family and has been found to be involved in myoblast fusion (Erickson, et aL 9 1997, J. Cell 
Biol. 138: 589). Since CLASP-2 expression is found in syncytial tissues sufch as placenta, 
muscle, and heart, it is believed that CLASP-2 is involved in mediating or inhibiting cell 
fusion. 

25 The DOCK family has been implicated in the control of cell shape. DOCK1 , 

when transfected into spindle cells, can make them flattened and polygonal (Takai, et aL 9 
1996, Genomics 35: 403-303). DOCK1 expression is ubiquitous except in hematopoetic 
cells. DOCK2 is expressed in hematopoetic cells and when transfected into spindle cells can 
make them round up (Nishihara, H., 1999, Hokkaido Igaku Zasshi 74: 157-66). DOCK2 is 

30 expressed in peripheral blood lymphocytes, thymus, spleen, and liver. 
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5.2.1. 7. COILED-COIL 

CLASP-2s have the two coiled-coil domains (Lupas et ai 9 1991, Science 252: 
1 162-64; Lupas, A., 1996, Meth. Enzymology 266: 513-525). Coiled-coil domains are 
known to interact directly with cytoskeleton, indicating that that CLASP-2 proteins interact 
5 directly with the cytoskeleton. Thus, it is believed that CLASP-2 binds cytoskeletal proteins, 
e.g., spectrin, ankyrin, hsp70, talin, ezrin, tropomyosin, myosin, plectin, syndecans, 
paralemmin, Band 3 protein, Cytoskeletal protein 4. 1, Tyrosine phosphatase PTP36 and other 
molecules. 

5.2.1.8. PDZ Ligand 

10 CLASP-2 proteins contain a PDZ-ligand motif ("PBM" or "PL") at the C- 

terminus of the protein. This short (3-8 amino acid) motif mediates the binding of proteins 
terminating at their carboxyl terminus in the motif (most commonly S/T - X - V - free 
carboxyl-terminus) to other proteins containing one or more specific PDZ domains (See 
Songyang et al. 9 1997, Science 275: 72 and Doyle et aL 9 1996, Cell 85: 1067 for a discussion 

1 5 of PDZ-ligand structures). 

PDZ domain-containing proteins are involved in the organization of ion 
channels and receptors at the neurological synapse and in establishing and maintaining 
polarity in epithelial cells via their binding to the C-termini of transmembrane receptors. It 
has been shown that PDZ-domain containing proteins can mediate protein-protein 

20 interactions in immune system cells (e.g., DLG1 binds to the lymphocyte potassium channel 
KV1.3 in human T lymphocytes, (Hanada et al 9 1997, J. Biol. Chem. 272: 26899). 

Biochemical evidence that CLASP-2 interacts with the PDZ domains of three 
closely related proteins is shown in FIG 9A-D. FIG. 9A demonstrates the specificity of the 
interaction r as the C-terminal 20 amino acids of CLASP-2 bind PSD-95, NeDLG, and DLG1, 

25 but not to the PDZ domains of the TIAM-1 protein. FIG. 9B demonstrates the affinity of the 
interaction. Notably, the highest affinity interaction occurs between CLASP-2 and NeDLG, 
with a specific binding affinity of at least 1 0 4 M" 1 . Affinities in the micromolar range have 
been found for other biologically important PDZ-ligand interactions. FIG. 9C demonstrates 
the ability to inhibit CLASP-2 PDZ interactions using either a short fragment of CLASP-2 

30 (the eight C-terminal amino acids) or the C-terminus of KV1.3. As noted above, KV1.3 is 
known to bind to DLG1 in live lymphocytes. FIG. 9D demonstrates that CLASP-2 and 
KV1 .3 compete for PDZ binding; z.e., not only does KV1 .3 block CLASP-2 binding but 
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CLASP-2 also blocks KV1 .3 binding. The ability of the eight C-terminal residues of 
CLASP-2 to inhibit the interaction of both CLASP-2 and KV1.3 with selected PDZ domains 
suggests that compounds related to the C-terminal eight-amino acids of CLASP-2, when 
introduced into cells, will mediate changes in multiple protein-protein interactions involved 
5 in the function of lymphoid tissues and other tissues that express these proteins (including 
heart, lung, and kidney). 

Evidence that the C-terminal 8 amino acids of CLASP-2, when introduced into 
cells, can effect cellular function comes from the experiments in which these amino acids 
were introduced into cells as a fusion, e.g., with the HIV-derived TAT transporter peptide 

10 sequence. Addition of the TAT-CLASP-2 fusion peptide to Jurkat T lymphocytes (compared 
to controls using the TAT peptide alone) results in subtle, time-dependent alterations in 
intracellular calcium concentrations as measured using the calcium indicator dye Fluo-4. 
While these results are consistent with the hypothesis that the TAT-CLASP-2 fusion changes 
T cell ion fluxes. In particular, the results indicate that the CLASP-2 C-terminal sequence 

15 can slightly increase basal intracellular calcium concentrations and can slightly decrease the 
proportional increase in calcium upon activation of the cells with anti-CD3 antibody. Such 
changes would be expected for a compound that disrupts localization of the T cell activation- 
associated CLASP-2 protein and the KV1 .3 potassium channel. Small changes in T cell 
calcium flux can result in large changes in the functional activity of the cells (Wulfing et aL, 

20 1997, J. Exp. Med. 185: 1815). 

5.2.1. 9. Modulation of Immune Responses 

CLASP-2 proteins, as described above, modulate immune function in a variety 
of ways and through a variety of mechanisms (i.e., changing the threshold for T cell 
activation) by affecting formation of the immunological synapse. Establishment and 
25 maintenance of the immunological synapse can involve: (A) signal transduction, (B) cell-cell 
interactions, and (C) membrane organization. 

(A) Signal transduction 

Human CLASP proteins, as discussed above, contain SH3 domains and 
tyrosine phosphorylation sites. These regions have been shown to be involved in signal 
30 transduction in a variety of cells including lymphocytes. Thus, human CLASP proteins are 
believed to interact with these regions during signal transduction events which lead to 
modulation of immune responses. 
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CLASP proteins can interact with Tec sub-family of nonreceptortyrosine 
kinases. The Tec sub-family of nonreceptor tyrosine kinases consists of Tec, Btk, 
Tsk/Itk/Emt ltk, and Bmx, and is defined by the presence of SH3 and SH2 domains adjacent 
to the catalytic domain and an amino-terminal region containing a pleckstrin homology (PH) 
5 domain, a Tec homology (TH) domain, and a proline-rich region (Mano, H.; 1 999, Cytokine 
Growth Factor Rev 10: 267-80). The T cell specific Tsk/ltk/Emt, and Btk expressed in most 
hematopoietic cells other than T cells are important components of antigen receptor signaling 

pathways in hematopoietic cells. 

Btk has been identified as the gene defective in murine X-linked 
1 0 immunodeficiency (xid) and human X-linked agammaglobulinemia (XLA) (Nisitani, S., 
2000, Proc Natl Acad Sci U.S.A. 97: 2737-42). In xid mice, B cell numbers are reduced to 
one-half of normal and the titers of specific immunoglobulin isotypes are significantly 
reduced; in addition, xid B cells are insensitive to a number of mitogenic stimuli. The human 
disorder is much more severe, resulting in nearly complete elimination of the B cell 
15 compartment and dramatically reduced immunoglobulin levels. Biochemical studies have 
supported multiple roles for Btk in B cell activation. Btk kinase activity and tyrosine 
phosphorylation are increased after cross-linking either the B cell receptor on B cells or the 
high affinity IgE receptor, FcRI, on mast cells. Interleukin-5 and interleukin-6 treatment 
have also been shown to lead to the activation of Btk. 
20 ltk, like Btk, is tyrosine-phosphorylated upon antigen receptor cross-linking 

(Mano, H., 1999, Cytokine Growth Factor Rev, 10: 267-80). In addition, peripheral T cells 
from mice lacking functional ltk are refractory to stimulation by antibodies to CD3 plus 
antigen presenting cells. These Itk-deficient T cells can be stimulated by phorbol ester and 
calcium ionophore, demonstrating that ltk acts in signaling pathways proximal to the TCR. 
25 Unlike the related Src family tyrosine kinases including Lyn, Lck, Fyn, ZAP- 

70, SyK, and CSK, the Tec family kinases lack the amino-terminal myristylation site crucial 
for the membrane localization of Src family kinases, suggesting that some adaptor proteins 
are required for the their membrane localization (Mano, H., 1999, Cytokine Growth Factor 
Rev 10: 267-80). Since all the Tec family kinases contain a proline-rich region which could 
30 be bound by a SH3 domain, and since all the human CLASPs contain a SH3 domain, it is 
believed that human CLASPs could serve as adaptors for the members in the Tec family in 

different hematopoietic cells. 

GTP-binding proteins play an important role in immune response (Mach, B., 
1999, Science 285: 1367). A number of biochemical events triggered by TCR/CD3-induced 
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T cell activation are ablated by agents that modulate the action of G proteins. Pertinent to this 
is the ability of cholera toxin to inhibit the cellular proliferation and intracellular Ca 
mobilization that is mediated by anti-CD3 antibody treatment of T cells. The G protein 
competitive inhibitor GDPS, can impede the extent of inositol phosphates generated upon 
5 stimulation in peripheral T lymphocytes. Nonhydrolyzable analogs of GTP, such as GTPS, or 
other agents such as ALF that activate G proteins by circumventing the need for receptor 
engagement, can result in T cell activation. 

The Guq/1 1 subfamily (Stanners, J., 1995, J Biol Chem 270: 30635-42) and 
Rapl (Lafont. V, I99S, Biochem Pharmacol 55: 319-24 ) of GTP-binding proteins have 

10 been shown to be involved in human T cell receptor/CD3-mediated signal transduction 

pathway. Also. Cdc42, a Rho family small GTPase, is known to play a critical role in the 
formation of actin microspikes in response to external stimuli (Miki- HL; 1998 7 Nature, 391 : 
93-6). Interestingly, a Cdc42 binding protein, WASP, has a proline-rich domain which could 
interact with the SH3 domain present in all the human CLASPs. Human CLASPs may 

1 5 interact with these GTP-binding proteins. 

Several adaptor proteins including NCK, CBL (Bachmaier, K., 2000 Nature 
403: 211-6), SHC LNK, SLP-76, HS1, SIT, VAV, GrB2, and BRDG1, and two tyrosine 
phosphotases, EZRIN, SHP-1 and SHP-2 have been shown to interact with ITAM or SH3 
domains. These proteins may also interact with CLASP-2. Several proteins have been shown 

20 to interact with ITAM or SH3 domains and may also interact with CLASP-2. These include 
adaptor proteins such as NCK, CBL (Bachmaier, K., 2000, Nature 403: 21 1-6), SHC, LAT, 
LNK, SLP-76 (Krause M et al, 2000, J Cell Biol 149: 181-94), HS1, SIT, VAV, GrB2 
(Zhang W. and Samclson, L.E., 2000, Semin Immunol 12: 35-41), and BRDG1, kinases such 
as SYK and LCK, and tyrosine phosphatases such as SHP-1 and SHP-2. These interactions 

* 

25 can be defined by a number of different biochemical or cell biological methods including in 
vitro binding assays, co-immunoprecipitation assays, co-immunostaining (Harlow, E. and 
Lane, D., 1999, Using Antibodies: A laboratory Manual. Cold Spring Harbor Press) or 
genetic assays such as yeast the yeast two hybrid system, in which a CLASP-2 protein or 
fragment can be used as "bait" (Zervos et al, 1993, Cell 72: 223-232; Madura et al, 1993, J. 

30 Biol. Chem 268: 12046-12054). 

Other assays include in vitro binding assays, co-immunoprecipitation assays, 
co-immunostaining assays, and yeast two hybrid system screening assays in which a CLASP- 
2 domain or fragment can be used as "bait" or 'trap" protein (Zervos et al (1993), Cell 72: 
223-232; Madura et al (1993) J. Biol. Chem. 268: 12046-12054). 
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In other embodiments, CLASP polypeptides are transfected into lymphocytes. 
After transfection, a variety of standard assays can be used to evaluate, for example, CLASP 
modulation of T cell activation. These assays include calcium influx assays, NF-AT nuclear 
translocation assays (e.g.. Cell, 1998, 93: 851-61), NF-AT/luciferase reporter assays (e.g., 
5 MCB 1 996 16:7151-71 60), tyrosine phosphorylation of early response proteins such as HS I , 
PLC-y, ZAP-76, and Vav (e.g., J. Biol. Chem. 1997, 272: 14562-14570). 

(B) Cell-Cell Interaction 

As discussed above, human CLASP proteins are homologues of E-cadherin. 
As shown in FIG. 1, CLASP-2 contains both a cadherin cleavage domain and a cadherin 
1 0 ectodomain. Therefore CLASP-2 proteins may interact with cadherins through these 
domains. The cadherins constitute a family of cell surface adhesion molecules that are 
involved in calcium-dependent cell to cell adhesion. Human cadherins, E-, P- N- and VE- 
cadherin, have a restricted tissue distribution: E- and P-cadherin are expressed in epithelial 
tissues, N-cadherin is found mainly on neural cells, and VE-cadherin is found on vascular 
1 5 endothelium. Homophilic binding between cadherins on adjacent cells is vital for the 
maintenance of strong cell to cell adhesion in these tissues. For example E-cadherin is 
required for the formation of adherens junctions between mature epithelial cells and is 
involved in Langerhans cell adhesion to keratinocytes, and VE-cadherin is needed for the 
maintenance of lateral association between endothelial cells. The extracellular regions of 
20 mature mammalian cadherins are comprised of five "CAD" modules of approximately 1110 
amino acids. Crystallographic and biochemical studies indicate that cadherins can form 
dimers on the cell surface, and that interaction with dimeric cadherins on opposing cell 
surfaces can lead to the formation of "zipper-like" cell junctions. 

The integrins are a second family of transmembrane adhesion molecules that 
25 are involved in both cell to cell and cell to matrix interactions. At least 15 chains associate 
with 8 chains to form a large number of heterodimeric integrins that can be classified into 
several major subfamilies based on their shared use of a particular chain. Members of three 
subfamilies, the 1 , 2, and 7 integrins, are commonly found on leukocytes. The expression of 
1 integrins is widespread (for example, 51, CD49e/CD29, is found on T cells, granulocytes, 
30 platelets, fibroblasts, endothelium, and epithelium), whereas the 2 and 7 integrins have a . 
restricted pattern of expression. 

Interestingly, E-cadherin on human epithelial cells has been found to be a 
ligand for the mucosal lymphocyte integrin, E7, and a similar interaction has been indicated 
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in the mouse. Monoclonal antibodies to E-cadherin or to E7 block IEL adherence to 
epithelial cells, and transfection of cells with E7 confers upon them the ability to adhere to 
cells transfected with E-cadherin, 

L929 cells can be transfected with CLASP-2 and Neomycin. G418-resistant 
5 clones can be screened for CLASP-expression with anti-CLASP peptide-specific antibodies. 
CLASP-expressing clones can be used to test for homotypic and/or heterotypic calcium 
dependent cell adhesion using the "cell aggregation assay" described for cadherin molecules 
(Murphy-Erdosh, C. et a/., 1995, J. Cell Biol. 129: 1379-1390). 

Several approaches can be used to identify the amino acids involved in the 

10 binding domains. Soluble fusion molecules (e.g., EC12-IgG, ECC-lgG, ECM-IgG, and GST- 
EC^), peptides, and peptide-specific anti-CLASP antibodies are available for blocking 
experiments in the above-described assay. Transfectants generated by site-directed 
mutagenesis can also be used. 

(C) Membrane Anchoring/Cytoskeletal Interactions 

15 Interestingly, tyrosine-phosphorylated ITAMs interact with actin cytoskeleton 

upon activation of mature T lymphocytes (Rozdzial, M. M., 1995, Immunity 3: 623-633). 
Since human CLASPs contain both ITAMs and coiled-coil domains which have been shown 
to interact with cytoskeletal proteins, CLASPs are believed to play an important role in 
modulating cell surface molecule expression by re-organizing cytoskeletal structure. 

20 F-actin microfilament cytoskeletal organization has been known to be 

involved in the modulation of cell surface molecule expression. WASP, a GTPase-binding 
protein, plays a critical role in the formation of actin microgpikes in response to external 
stimuli and ectopic expression of WASP induces the formation of F-actin filament clusters 
that overlap with the expressed WASP itself. Another WASP family protein, N-WASP, has 

25 also been shown to play important roles in filopodium formation. Both of these proteins 
cause actin polymerization, but with different features when they are expressed in cells; 
WASP mainly localizes at perinuclear areas and causes actin clustering, but most N-WASP is 
present at plasma membranes and induces filopodium formation (Miki, H.; 1998, Nature 391: 
93-6). Both WASP and N-WASP, contain a proline-rich domain which could interact with 

30 the SH3 domain present in all the human CLASPs. CLASP-2 may interact with F-actin 
filament through CLASP-2 binding to WASP or WASP-like proteins. 

Standard assays can be used for detecting CLASP protein interaction with 
cytoskeletal proteins. These assays include co-sedimentation assays, far western blot analysis 
(Ohba, T., 1998, Anal. Biochem. 262: 185-192), surface pasman resonance, F-actin staining 
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with phalloidin in CLASP-transfected lymphocytes (e.g., Small, J. et al. 1999, Microsc. Res. 
Tech. 4: 3-17), and immunocytal analysis of subcellular distribution of focal adhesion 
proteins (such as paxillin, tensin, vinculin, talin, and FAK in CLASP-transfected 
lymphocytes; see. e.g.. Ridyard, M.S., 1998, Biochem. Cell Biol. 76: 45-58). 

5 5.2.2. CLASP-2 Exon Structure and Genomic Domains 

Alternative splicing is likely to represent a regulatory switch that governs 
different functions of CLASP-2 in immune responses Additionally, alternative splice 
variants affecting the untranslated regions of an RNA can be a way of regulating RNA 
stability. 

1 0 As noted supra, CLASP-2 gene expression is characterized by alternative 

exon usage. Introa'cxon structure can be predicted by computer analysis of genomic DNA, 
however, splice junctions and alternative splicing can only be elucidated by comparison of 
genomic clones to cDNA clones. Alternative splicing and RNA editing are mechanisms 
generate a variety of proteins from the same gene. An example for how alternative splicing is 
1 5 used to generate thousands of different proteins from only a few genes is represented by the 
Neurexin gene family (for review of Neurexins, see Missler M. and Suedhof, T., 1998, 
Trends in Genetics, 14: 20-25). Comparative analysis of CLASP-2 genomic clones and 
cDNA clones revealed that CLASP-2 is composed of numerous exons and that distinct 
CLASP-2 transcripts are generated by alternative splicing. The protein encoding portion of 
20 CLASP-2 is covered by at least 14 exons (FIG. 6A). 

Numerous diseases are caused or are thought to be caused by splice site 
mutations that can cause exon skipping or otherwise result in a truncated protein product 
Some of these diseases include, e.g., Marfan Syndrome (Liu W, et al., 1997, Nat. Genet 16: 
328-9), Hunter disease (Bonucelli G, et al., 2000, Hum. Mutat. (Online) 2000 15(4): 389, 
25 Duchenne muscular dystrophy (Wibawa T, et al., 2000, Brain Dev. 22(2): 107-1 12), 
Myelomonocytic leukemia (Wutz D, et al., 1999, Leuk. Lymphoma 35: 491-9.), and 
Isovaleric acidemia (Vockley J, et al., 2000, Am. J. Hum. Genet. 66: 356-67). This is 
especially true for genes composed of many exons (such as CLASP-2). The genomic 
sequence around CLASP-2 exon/intron boundaries is useful for diagnostic approaches 
30 towards the identification of diseases caused by splice site mutations. The abundance or 

presence of CLASP-2 isoforms in cell populations (e.g., hematopoietic cells, lymphocytes) is 
correlated with a disease state by comparing the abundance of CLASP-2 in cells from 
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subjects suffering from the disease with the level of CLASP-2 in cells from healthy subjects. 
This can be accomplished by utilizing any number of assays (e.g., PCR). 

Alignment of the CLASP-2 intron/exon splice sites with the CLASP-2 protein 
sequence and the finding of conserved exon/intron boundaries within the CLASP gene family 
5 (FIG. 6) suggest that specific CLASP-2 exons encode functionally distinct protein domains 
(sec FIG. 6 and Example 4). ITAM and DOCK motifs 1 and 2 are encompassed by splice 
sites (amino acid residues 946 and 1063); DOCK motif 3 and COILED-COIL motif 1 and 2 
arc also encompassed by splice sites (amino acid residues 1 102, 1 170 and 1246, 
respectively). 

10 CLASP-2 alternative transcripts are summarized in FIG. 3 and FIG.l IB. 

Briefly, one alternative exon missing in CLASP-2A is present in CLASP-2B and CLASP-2D. 
This exon contains the DNA portion encoding the ITA-M motif and 

CLASP-2D protein product does not contain the C-terminal 38 amino acids of CLASP-2A 
and CLASP-2B: Thus, a PDZ binding motif (SSVV; amino acid residue 1286 through 1289) 

15 that is only present in the CLASP-2A/B-specific C-terminal end is missing in the CLASP-2D 
gene product. The presence or absence of this PDZ binding motif can be attributed to 
alternative RNA processing. Additionally, a CLASP-2 alternative transcript has been found 
that deletes nucleotides 209-291 that results in a premature stop codon. The protein encoded 
by this transcript appears to be a soluble form of CLASP-2 that may regulate (e.g., is an 

20 antagonist or an agonist) the function other CLASP family members and isoforms. 

5.23. CLASP Superfamilv Members 

As is illustrated in FIG. 5, CLASP-2 is a member of a superfamily of immune- 
cell associated proteins with similar motifs. CLASP-1 was described in U.S.S.N. 09/41 1,328, 
filed October 1, 1999. CLASP-1 uniquely among the known CLASPs contains SH3 binding 
25 domain motifs. CLASP-2A, -B, -C, and -E polypeptides have no adaptor binding sites or SH3 
binding domains found in CLASP-1. CLASP-3, CLASP-4, CLASP-5 and CLASP-7 are 
described in copending U.S.S.N. 60/182,296, filed February 14, 2000, and which is 
incorporated by reference herein in its entirely for all purposes. 

5.3. CLASP-2 mRNA Expression 

30 As described in Example 2, CLASP-2 mRNA expression was assayed in 

tissues and cell lines by Northern analysis. The results are shown in FIG. 4A and B. The 
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results of Northern Analysis of CLASP-2 expression and expression of other members of the 
CLASP family are summarized in Table 2. 


Table 2 


Tissue/Cell Line 1 


CLASP 


1 


PBL 


Lung 


Placenta 


-/+ 


Sm Intestine 


-/+ 


Liver 


Kidney 


-/+ 


Spleen 


Thymus 


-/+ 


+ 


Colon 


Skel Muscle 


Heart 


Brain 


Jurkat 


-H- 


MV411 


THP1 


HL60 


9D10 


3A9 


CH27 


293 


-/+ 


-/+ 


-/+ 


+++ 


-/+ 


+++ 


■*-++ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


-/+ 


1. Jurkat = human T cell line; MV4-11 = B myelomonocyte; 9D10 = B cell 
5 line; THP-1 = monocyte; 3A9 = mouse T cell; CH27 = mouse B cell line; 

HL60 = human promyelocyte; 293 = embryonic kidney epithelial cells (293) 

2. Table Legend (based on Northern blot results): - - no expression; -/+ = low 
expression; + ^^edium expression; ++ medium high expression; +++ high 
expression. 

10 3. A CLASP-2 EST (EST 815795) was identified from a bone marrow cDNA 

library. 

4. The probe used (HC2.2) did not distinguish between CLASP-2 A, -2B, -2C 
and 2D.. This probe encompasses nucleotides 3920 to 4650 (731 bp long) 
from CLASP-2A cDNA. 

15 5. In RNA from 9D10, the major transcript runs substantially shorter than the 

major transcripts seen in Jurkat and 293 cells; however, the longer transcript is 
also present in 9D10. Hybridization of probe HC2.2 with 9D10 total RNA 
reveals at least 3 different transcripts. See FIG. 4B 

As indicated in Table 2 and shown in FIG. 4, CLASP-2 is expressed most 
20 strongly in placenta followed by lung, kidney and heart; CLASP-3 is expressed strongly in 
kidney and heart, and less strongly in placenta and skeletal muscle ; CLASP-4 is expressed 
exclusively in peripheral blood lymphocytes; CLASP-5 is expressed strongly in peripheral 
blood leukocytes, present in placenta, kidney, spleen and thymus, and weakly in lung, small 
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intestine and liver. It is not expressed in brain, heart, skeletal muscle and large intestine; 
CLASP-7 is expressed strongly in lung, heart, liver and kidney, but not in PBL, brain or 
thymus. 

Differences in tissue expression patterns for different CLASP proteins indicate 
5 different CLASPs have differential roles in immune function and, accordingly, can be 
separately targeted to achieve different functions. For example, since CLASP proteins are 
necessary for proper function or signaling by the T cell receptor (TCR), the tissue specific 
distribution of different CLASPs permits differential modulation of the immune response in 
different tissues. Since CLASP-2 is present in heart, blocking CLASP-2 function or 

10 expression is useful to selectively block immune response in the heart (for example, to 

selectively stop immune response in the heart compartment, e.g., following cardiac transplant 
rejection or post-MI inflammation, without compromising immunity elsewhere. Similarly, 
blocking CLASP-3 can block rejection of the kidney following kidney transplant. 
Furthermore, by adjusting the level of inhibition, the degree of immune blockage versus 

1 5 response can be modulated in the compartments represented by each CLASP. 

5.4. CLASP-2 Polynucleotides And Methods Of Use 

The present invention provides a variety of CLASP-2 polynucleotides and 
methods for using them. In one aspect, the polynucleotide of the invention encodes a 
polypeptide comprising at least a fragment (e.g., an immunogenic fragment) of a CLASP-2 
20 protein (e.g., at least a fragment of SEQ. ID. NO: 2, 4, 6 or 10) or variant thereof. In another 
aspect, the molecules that comprise a CLASP-2 polynucleotide that, while not necessarily 
encoding a CLASP-2 protein or fragment, is useful as a probe or primer for detecting 
CLASP-2 expression, for inhibition of CLASP-2 expression (e.g., antisense or ribozyme- 
mediated inhibition), for gene knockout, and the like. 

25 5.4.1. CLASP-2 Polynucleotides 

The invention also provides isolated or purified nucleic acids having at least 8 
nucleotides (i.e., a hybridizable portion) of a CLASP-2 sequence or its complement; in other 
embodiments, the nucleic acids consist of at least about 25 (continuous) nucleotides, about 50 
nucleotides, about 100 nucleotides, about 150 nucleotides, about 200 nucleotides, about 250 
30 nucleotides, about 500 nucleotides, about 550 nucleotides, about 600 nucleotides, or about 
650 nucleotides or more of a CLASP-2 sequence, or a full-length CLASP-2 coding sequence. 
In another embodiment, the nucleic acids are smaller than about 35, about 200 or about 500 
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nucleotides in length. Polynucleotides can be single or double stranded, and may be DNA, 
RNA, PNA or a hybrid molecule. 

In specific aspects, nucleic acids are provided which comprise a sequence 
complementary to at least about 10, 25, 50, 100, 150, 200, 250, 500, 550, 600, or 650 
5 nucleotides or the entire coding region of a CLASP-2 coding sequence. Usually, the isolated 
polynucleotide is less than about 100 kbp, generally less than about 50 kbp, and often less 
than about 20 kbp, less than about 10 kbp, less than about 5 kbp, or less than about 1000 
nucleotides in length. 

In a specific embodiment, a nucleic acid that is hybridizable to a CLASP-2 
10 nucleic acid or its complement, or to a nucleic acid encoding a CLASP-2 derivative, under 

conditions of low stringency is provided. Derivatives of CLASP-2 contemplated include, but 
are not limited to, splice variants of a gene encoding a CLASP-2, other members of a 
CLASP-2 gene family which differ from one of the CLASP-2 nucleotide or amino acid 
sequences disclosed herein by the insertion or deletion of one or several domains, and the 
1 5 like. 

In one embodiment, the CLASP-2 polynucleotide is identical or exactly 
complementary to SEQ. ID NO: 1, 3, 5 or 9 or selectively hybridizes to an aforementioned 
sequence. In various embodiments, the polynucleotide is identical or exactly complementary 
to, or selectively hybridizes to, the nucleotide sequence encoding a particular protein domain 

20 or region, or a particular gene exon of the CLASP-2 mRNA or genomic sequence. Such 
polynucleotides are particularly useful as probes, because they can be selected to identify a 
defined species of CLASP-2. 

In addition to the polypeptide and polynucleotide sequences specifically 
exemplified herein, the invention contemplates CLASP-2 homologues from other species, 

25 allelic and splice variants, and other variants disclosed herein. The CLASP-2 gene exhibits 
evidence of alternative splicing of transcripts. 

For example, CLASP-2A and CLASP-2C are related to each other as apparent 
splice variants, with CLASP-2C containing an exon not found in CLASP-2A. The exon 
sequence is 5'-AGG GAT TTT GAG AGG CTG GCC CAT CTG TAT GAC ACG CTG 

30 CAC CGG GCC TAC AGC AAA GTG ACC GAG GTC ATG CAC TCG GGC CGC AGT 
TNC TGG GGA CCT ACT TCC GGG TAG CCT TCT TCG GGC AG-3* (encoding the 
peptide sequence: RDFERL AHLYDTLHRA YSK VTE VMH S GRRLLGT YFR V AFFGQGF) . 
It will be apparent to one of skill that, by using polynucleotide probes or primers 
corresponding to the nucleic acid sequence above, or by using antibodies that specifically 
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recognize the peptide above, or those polynucleotide probes or primers shown inTable 3 
below, it is possible to distinguish between different CLASP isoforms(e.g.. to detect 
differential expression). 


Table 3 



I'ound in/will detect 

Exemplary Probe/Primer (5* - 3*) 

Notes/Comments 

1 

full length hC2A 

l : 1 : CCCAGAmTTATG ATG AG 
Rl: GATAATGACAAAGTTCTGAC 


2 

Full length hC2D . 

F2: CTGGAAATCTTGACAAAAATGC 
R2: GTCT1TTTAATACAGATCTGG 


3 

hC2B, hC2C. hC2E 

F3: GAGAGGCTGGCCCATCTGTATG 
R3: ATCTTCAAAGAATCCCTGCC 

Distinction based upon 
product size differences 
following PCR 

A 
H 


r4. OAAOC-AOTCCAO FGGGAGCCG 
R4: GCCTCCCCGGCTCCTCCTCAGG 

Recognizes hC2D-specific 
insertion 

5 

hC2D 

F3: GAGAGGCTGGCCCATCTGTATG 
R5: CCTCCACATCTGTTTCACTGTC 


6 

hC2E 

F5: CTCCATGATGGAAGACGTGGG 
R6: GATGAGCTCGTAGCGCTCGGC 

Spans deletion unique to 
hC2E. Distinction based 
upon product size differences 
following PCR 

7 

hC2B 

F6: CATTGGCGTTTAAGCTCCTG 
R3: ATCTTCAAAGAATCCCTGCC 

F6 primer spans deletion 
unique to hC2E 

8 

hC2A 

F7: GG ACCC ATAGTTC ATG A TCG 
R4: CTTC ATCTTC A AG A A ATCCCTC 

R4 primer spans the region 
where other CLASPs have an 
insert 


5.4.1.1. Substantial Identity 

In some embodiments, the CLASP-2 polynucleotides of the invention are 
substantially identical to SEQ ID NOs: 1, 3, 5, or 9, or to a fragment thereoT. 

An indication that two nucleic acid sequences are substantially identical is that 
10 the two polynucleotides have a specified percentage sequence identity e.g., usually at least 
about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, or 
at least about 98 identity over a specified region when optimally aligned. 

Another indication that two nucleic acid sequences are substantially identical 
is that a polypeptide encoded by the first nucleic acid is immunologically cross reactive with 
15 the antibodies raised against the polypeptide encoded by the second nucleic acid, as described 
below. Thus, a polypeptide is typically substantially identical to a second polypeptide, for 
example, where the two peptides differ only by conservative substitutions. Another indication 
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that two nucleic acid sequences are substantially identical is that the two molecules or their 
complements hybridize to each other under stringent conditions, as described below. 

Yet another indication that two nucleic acid sequences are substantially 
identical (e.g.. a naturally occurring allele of the CLASP-2 sequence of SEQ ID NO: 1 ) is 
that the same primers can be used to amplify the sequence. For example, CLASP-2 
polynucleotides can be PCR amplified from cDNA derived from human lymphocytes using 
the primer pairs shown in Table 3. 

The primers of Table 3 are also useful for amplification of CLASP-2 splice 
variants. Another indication that two nucleic acid sequences are substantially identical is that 
they selective hybridize under stringent conditions (z\c, one sequence hybridizes to the 
complement of the second sequence), as described infra. 

5.4.1.2. Selective Hybridization 

The invention also relates to nucleic acids that selectively hybridize to 
exemplified CLASP-2 sequences (including hybridizing to the exact complements of these 
sequences). Selective hybridization can occur under conditions of high stringency (also 
called "stringent hybridization conditions'*), moderate stringency, or low stringency. 

5.4.1.2.1. High Stringency 

"Stringent hybridization conditions" are conditions under which a probe will 
hybridize to its target subsequence, typically in a complex mixture of nucleic acid, but not to 
other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures. An 
extensive guide to the hybridization of nucleic acids is found in Tijssen, Techniques in 
Biochemistry and Molecular Biology— Hybridization with Nucleic Probes, "Overview of 
principles of hybridization and the strategy of nucleic acid assays" (1993). Generally, 
stringent conditions are selected to be about 5-10°C lower than the thermal melting point (T m ) 
for the specific sequence at a defined ionic strength pH. The T m is the temperature (under 
defined ionic strength, pH, and nucleic concentration) at which 50% of the probes 
complementary to the target hybridize to the target sequence at equilibrium (as the target 
sequences are present in excess, at T m » 50% of the probes are occupied at equilibrium). 
Stringent conditions will be those in which the salt concentration is less than about 1.0 M 
sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at pH 7.0 
to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) 
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and at least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent 
conditions may also be achieved with the addition of destabilizing agents such as formamide. 
For high stringency hybridization, a positive signal is at least two times background, 
preferably 1 0 times background hybridization. Exemplary high stringency or stringent 
5 hybridization conditions include: 50% formamide, 5x SSC and 1% SDS incubated at 42° C or 
5x SSC and 1 % SDS incubated at 65° C, with a wash in 0.2x SSC and 0.1% SDS at 65° C. In 
a specific embodiment, a nucleic acid which is hybridizable to a CLASP-2 nucleic acid under 
the following conditions of high stringency is provided: Prehybridization of filters containing 
DNA is carried out for 8 h to overnight at 65°C in buffer composed of 6X SSC, 50 mM 

10 Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 ^g/ml 

denatured salmon sperm DNA. Filters are hybridized for 8-16 h at 65°C in prehybridization 
mixture containing 100 jig/'mi denatured salmon sperm DJNlA and 5-20 X 10° cpm of 
32 P-labeled probe. Washing of filters is done at 65°C for 15-30 h in a solution containing 2X 
SSC, 0.1% SDS. This is followed by a wash in 0.2X SSC and 0.1% at 50°C for 1 5-30 min 

1 5 before autoradiography. 

5.4.1.2.2. Moderate Stringency 

In another specific embodiment, a nucleic acid, which is hybridizable 
to a CLASP-2 nucleic acid under conditions of moderate stringency is provided. Examples 
of procedures using such conditions of moderate stringency are as follows: Filters containing 

20 DNA are pretreated for 6 h at 55°C in a solution containing 6X SSC, 5X Denhart's solution, 
0.5% SDS and 100 |ig/ml denatured salmon sperm DNA. Hybridizations are carried out in 
the same solution and 5-20 X 10 6 cpm 32 P-labeled probe is used. Filters are incubated in 
hybridization mixture for 12-16 h at 55°C, and then washed twice for 30 minutes at 50°C in a 
solution containing IX SSC and 0.1% SDS. Filters are blotted dry and exposed for 

25 autoradiography. Other conditions of moderate stringency which can be used are well-known 
in the art. Washing of filters is done at 45°C for 1 h in a solution containing 0.2X SSC and 
0.1% SDS. 

5.4.1.23. Low Stringency 

By way of example and not limitation, procedures using such conditions of 
30 low stringency are as follows (see also Shilo and Weinberg, 1981, Proc. Natl. Acad. Sci. 
U.S.A. 78: 6789-6792): Filters containing DNA are pretreated for 6 h at 40 C in a solution 
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containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA,S.1% PVP, 
0.1% Ficoll, 1% BSA, and 500 ^ig/ml denatured salmon sperm DNA. Hybridizations are 
carried out in the same solution with the following modifications: 0.02% PVP, 0.02% Ficoll, 
0.2% BSA, 100 g/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 
32 P-labeled probe is used. Filters are incubated in hybridization mixture for 1 8-20 h at 40 C, 
and then washed for L5 h at 55 C in a solution containing 2X SSC and 0.1% SDS. The wash 
solution is replaced with fresh solution and incubated an additional 30 minutes at 50-55°C. 
Filters are blotted dry and exposed for autoradiography. If necessary, filters are washed for a 
third time at 60-65°C and reexposed to film. Other conditions of low stringency that can be 
used are well known in the art (e.g., as employed for cross-species hybridizations). 

5.4.1,3. CLASP-2 Variants and Fragments 

The CLASP-2 variants of the invention can contain alterations in the coding 
regions, non-coding regions, or both. Especially preferred are polynucleotide variants 
containing alterations which produce silent substitutions, additions, or deletions, but do not 
alter the properties or activities of the encoded polypeptide. Nucleotide variants produced by 
silent substitutions due to the degeneracy of the genetic code are preferred. CLASP-2 
polynucleotide variants can be produced for a variety of reasons, e.g., to optimize codon 
expression for a particular host (change codons in the human mRNA to those preferred by a 

bacterial host such as E. coli). 

Exemplary CLASP-2 polynucleotide fragments are preferably at least about 
15 nucleotides, and more preferably at least about 20 nucleotides, still more preferably at 
least about 30 nucleotides, and even more preferably, at least about 40 nucleotides in length, 
or larger 50, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650 nucleotides. In one 
embodiment, exemplary fragments include fragments having at least a sequence from about 
nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 

401-450, 451-500, 501-550, 551-600 to the end of SEQ ID NO: 1 or SEQ ID NO: or 

comprising the cDNA coding sequence in the deposited clones. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. Preferably, these fragments encode a 
polypeptide which has biological activity. More preferably, these polynucleotides can be used 
as probes or primers as discussed herein. 

In other embodiments, CLASP-2 polynucleotides of the invention are other 

than SEQ ID NO: 1 or fragments of SEQ ID NO: 1 . 

42 


0231117A2 L> 


WO 02/31 117 


PCT/USO 1/32202 


As shown in FIG 1 1 above, there are at least three CLASP-2 full length cDNA 
isoforms (A + Z, B + Z, and C + Z). Each of the isoforms uses a unique first exon (labelled 
exon 1 A, IB. and 1 C) (see FIG. 1 1 and Table 4 below). 

Table 4: CLASP-2 Isoforms 


CLASP-2 Isoform 

FIG 1 1 C Schematic 

Nucleotides 

Isoform 1 

A + Z 

-182 to 6690 

Isoform 2 

i ■ 

B + Z 

-219 to 6690 

1 so form 3 

C + Z 

-143 to 6690 


5 

In one embodiment, the CLASP-2 polynucleotide has the sequence shown in 
FIG. 1 1 (Isoform 1. Isoform 2. or Isoform 3 as indicated in Table 4 above) or a .fragment of 
the sequence shown in FIG. 1 1 comprising at least about 1,5, 10, 25 or 50 or more 
continguous nucleotides from nucleotides —182 to 1 883 of Isoform 1, nucleotides -219 to 

10 1883 of Isoform 2 f or nucleotides -143 to 1883 of Isoform 3. 

In another embodiment, CLASP-2 primers or probes comprise at least about 5, 
10, 25 or 50 or more continguous nucleotides from nucleotides —182 to 1883 of Isoform 1, 
nucleotides -219 to 1SS3 of Isoform 2, or nucleotides -143 to 1883 of Isoform 3 as shown in 
FIG. 1 1 and Tabic 4 above alone or in combination with SEQ ID NO:l or a fragment of SEQ 

15 EDNO:l. 

In an aspect, the invention provides antibodies or binding fragments that bind 
the CLASP-2 isoforms 1-3. In another embodiment, the invention provides antibodies that 
specifically bind to the CLASP-2 isoforms shown in FIG. 1 1 but not to the polypeptide 
encoded by SEQ ID NO: 1 

20 In one embodiment, the CLASP-2 variants differ from those shown in FIG. 1 

or FIG. 1 1 (SEQ ID NOs 1 , 3, 5, 7 9, ) by virtue of incorporating a different 

combination of exons than found in the exemplified sequences. For example, 81g01 
(Genbank Accession Number AF85864; Locus HUMYN81g01 ; 526 bp; EST sequence 
submitted August 29, 1998 by Washington University at St. Louis; see FIG. 3 A and FIG. 3B) 

25 is a variant of hCLASP-2 on the basis of CLASP-2 identity along certain stretches of the 

sequence. From 5' to 3', it begins with a 315 nucleotide stretch which is identical to CLASP- 
2A. It then has a gap relative to CLASP-2A that is identical to the GAP in another CLASP-2 
isoform, hCLASP-2D (KIAA1058). In place of that gap, a 16 amino acid insert (48 
nucleotides) is present which is not found in other isoforms, followed by another 
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approximately 150 bp stretch of nucleotides once again identical to CLASP-2A~_ This is 
characteristic of an alternate splice due to the specific sequence identity on both sides of a 
differential stretch of nucleotides. 

Using known methods of protein engineering and recombinant DNA 
5 technology, variants can be generated to improve or alter the characteristics of the CLASP-2 
polypeptides. For instance, one or more amino acids can be deleted from the N-terminus or 
C-terminus of the CLASP-2 protein without substantial loss of biological function. 

Furthermore, even if deleting one or more amino acids from the N -terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
10 functions, other biological activities can still be retained. For example, the ability of a 

deletion variant to induce and/or to bind antibodies which recognize the secreted form will 
likely be retained when less than the majority of the residues of the secreted form are 
removed from the N-terminus or C-terminus. Whether a particular polypeptide lacking Nor 
C-terminal residues of a protein retains such immunogenic activities can readily be 
] 5 determined by routine methods described herein and otherwise known in the art. 

Thus, the invention further includes CLASP-2 polypeptide variants which 
show biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general rules known in the art so as have little effect on 
activity. For example, guidance concerning how to make phenotypically silent amino acid 
20 substitutions is provided in Bowie, J. U. et al, Science 247: 1306-1310 (1990), wherein the 
authors indicate that there are two main strategies for studying the tolerance of an amino acid 
sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in different 
25 species, conserved amino acids can be identified. These conserved amino acids are likely 

important for protein function. In contrast, the amino acid positions where substitutions have 
been tolerated by natural selection indicates that these positions are not critical for protein 
function. Thus, positions tolerating amino acid substitution could be modified while still 
maintaining biological activity of the protein. 
30 The second strategy uses genetic engineering to introduce amino acid changes 

at 30 specific positions of a cloned gene to identify regions critical for protein function. For 
example., site directed mutagenesis or alanine-scanning mutagenesis (introduction of single 
alanine mutations at every residue in the molecule) can be used. (Cunningham and Wells, 
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1989, Science 244: 1081-1085) The resulting mutant molecules can then be tested-for 
biological activity. 

In various embodiments, CLASP-2 polynucleotide fragments include coding 
regions for, or regions hybridizable to, the CLASP-2 structural or functional domains 
5 described supra. As set out in the Figures, such preferred regions include the following 
domains/motifs: IT AM, DOCK, COILED/COILED, and PBM. Thus, for example, 
polypeptide fragments of CLASP-2 as shown in FIG. 1 and FIG. 1 1-(SEQ ID NO: 2, 4, 6 10, 

) falling within conserved domains are specifically contemplated by the present 

invention (see FIG. 3). Moreover, polynucleotide fragments encoding these domains are also 
10 contemplated. Such polypeptide fragments find use, for example, as inhibitors of CLASP-2 
function in CLASP-2-expressing cells. 

5.4.2. Uses of CLASP-2 Polynucleotides 

The CLASP-2 polynucleotides of the invention are useful in a variety of 
applications. In one aspect of the invention, the polypeptide-encoding CLASP-2 

1 5 polynucleotides of the invention are used to express CLASP-2 polypeptides (e.g., as 

described herein) for example to produce anti-CLASP-antibodies or for use as therapeutic 
polypeptides. In another aspect, the CLASP-2 polynucleotide or fragments thereof can be 
used for diagnostic purposes (e.g., as probes for CLASP-2 expression). In particular, since 
CLASP-2s can be expressed in lymphocytes, a CLASP-2 polynucleotide can be used to 

20 detect the expression of CLASP-2 as a lymphocyte marker. For diagnostic purposes, a 
CLASP-2 polynucleotide can be used to detect CLASP-2 gene expression or aberrant 
CLASP-2 gene expression in disease states. In another aspect, the CLASP-2- polynucleotide 
or fragments are used for therapeutic purposes. For example, included in the scope of the 
invention are methods for inhibiting CLASP-2 expression, e.g., using oligonucleotide 

25 sequences, such as antisense RNA and DNA molecules and ribozymes, that function to 

inhibit expression of CLASP-2. In another aspect, CLASP-2 polynucleotides can be used to 
construct transgenic and knockout animals, e.g., for screening of CLASP-2 agonists and 
antagonists. In another aspect, CLASP-2 polynucleotides can be used for screening of 
CLASP-2 agonists and antagonists. 

30 5.4.2.1. Use of CLASP-2 P lvnucleotides for Detection, Diagnosis, and Treatment 

The CLASP-2 polynucleotides of the invention are useful for detection of 
CLASP-2 expression in cells and in the diagnosis of diseases or disorders (e.g., 
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immunodeficicnt states) resulting from aberrant expression of CLASP-2. Aberrant 
expression of CLASP-2 mRNA or protein means expression in lymphocytes (e.g., T 
lymphocytes or B lymphocytes) or other CLASP-2 expressing cells of at least 2-fold, 
preferably ai least 5-fold greater or less than expression in control lymphocytes obtained from 
a healthy subject. CLASP-2 polypeptide expression is easily measured by EL1SA using anti- 
CLASP-2 antibodies of the invention. CLASP-2 mRNA expression (including expression of 
specific species or splice variants of CLASP-2) can be measured by quantitative Northern 
analysis or quantitative PCR, LCR, or other methods, using the probes and primers of the 
invention. 

In one embodiment, the assays of the present invention are amplification- 
based assays for detection of an CLASP-2 gene product. In an amplification based assay, all 
or part of a CLASP-2 mRNA or cDNA (hereinafter also referred to as "target") is amplified, 
and the amplification product is then detected directly or indirectly. When there is no 
underlying gene product to act as a template, no amplification product is produced (e.g., of 
the expected size), or amplification is non-specific and typically there is no single 
amplification product. In contrast, when the underlying gene or gene product is present, the 
target sequence is ampli fied, providing an indication of the presence and/or quantity of the 
underlying gene or mRNA. Target amplification-based assays are well known to those of 
skill in the art. 

The present invention provides a wide variety of primers and probes for 
detecting CLASP-2 genes and gene products. Such primers and probes are sufficiently 
complementary to the CLASP-2 gene or gene product to hybridize to the target nucleic acid. 
Primers are typically at least 6 bases in length, usually between about 10 and about 100 bases, 
typically between about 12 and about 50 bases, and often between about 14 and about 25 
bases in length, often PCR primers of 15-30 (e.g., 18-22 nucleotides) are used. However, the 
length of primers can be adjusted by one skilled in the art. One of skill, having reviewed the 
present disclosure, will be able, using routine methods, to select primers to amplify all, or any 
portion, of the CLASP-2 gene or gene product, or to distinguish between variant gene 
products, CLASP-2 alleles, and the like. Single oligomers (e.g., U.S. Pat. No. 5,545,522), 
nested sets of oligomers, or even a degenerate pool of oligomers can be employed for 
amplification. 

It will be appreciated that probes and primers can be selected to distinguish 
between species and splice variants based on the guidance of this disclosure, by targeting 
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primers or probes to differentially used exons (or exon-exon junctions that differ between 
variants). 

Methods can include the steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting 
5 the nucleic acid sample with one or more primers which specifically hybridize to an CLASP- 
2 gene under conditions such that hybridization and amplification of the CLASP-2-gene (if 
present) occurs, and detecting the presence or absence of an amplification product, or 
delecting the size of the amplification product and comparing the length to a control sample. 
See U.S. Pat. Nos. 4,683,195 and 4,683,202, Landegran et ah y 1988, Science 241: 1077-1080; 

10 Nakazawa et ah, 1994, Proc. Natl. Acad. Sci. U.S.A. 91 : 360-364, Abravaya et ah, 1995, 
Nucleic Acids Res. 23: 675-682). 

Because CLASP-2 gene products are expressed in the immune system (e.g., T 
lymphocytes, B lymphocytes and macrophages), expression will be typically assayed in these 
cells. Methods which are well known to those skilled in the art can be used to isolate 

15 lymphocytes, macrophages, and alike (See, e.g., Coligan, J. E., et ah (eds.), 1991, Current 
Protocols in Immunology, John Wiley & Sons, NY; this reference is incorporated by 
reference for all purposes). In one embodiment, assays are carried out on biopsy or autopsy- 
derived tissue. 

In various embodiments, CLASP-2 gene expression is detected by 
20 hybridization of a detectable probe to mRNA or cDNA obtained from cells (e.g., 

lymphocytes). A variety of methods for specific DNA and RNA measurement using nucleic 
acid hybridization techniques are known to those of skill in the art (see Sambrook et ah, 
supra). Hybridization based assays refer to assays in which a probe nucleia acid is 
hybridized to a target nucleic acid, forming a hybridization complex. Usually the nucleic 
25 acid hybridization probes of the invention are entirely or substantially identical to a 

contiguous sequence of the CLASP-2 gene or RNA sequence. Preferably, nucleic acid 
probes are at least about 50 bases, often at least about 20 bases, and sometimes at least about 
200 bases, at least about 300-500 nucleotides or more in length. Various hybridization 
techniques are well known in the art, and are in fact the basis of many commercially available 
30 diagnostic kits. 

Methods of selecting nucleic acid probe sequences for use in nucleic acid 
hybridization are discussed in Sambrook et ah 9 supra. In some formats, at least one of the 
target and probe is immobilized. The immobilized nucleic acid can be DNA, RNA, or 
another oligo- or poly-nucleotide, and can comprise natural or non-naturally occurring 
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nucleotides, nucleotide analogs, or backbones. Such assays can be in any of several formats 
including: Southern, Northern, dot and slot blots, high-density polynucleotide or 
oligonucleotide arrays (e.g., GeneChips™ Affymetrix), dip sticks, pins, chips, or beads. All 
of these techniques are well known in the art and are the basis of many commercially 
5 available diagnostic kits. Hybridization techniques are generally described in Hames et ai, 
ed., 1985, Nucleic Acid Hybridization, A Practical Approach IRL Press; Gall and Pardue, 
1969, Proc. Natl. Acad. Sci. U.S.A., 63: 378-383; and John et aL y 1969, Nature, 223: 582- 
587. 

A variety of nucleic acid hybridization formats are known to those skilled in 
10 the art. For example, one common format is direct hybridization, in which a target nucleic 
acid is hybridized to a labeled, complementary probe. Typically, labeled nucleic acids are 
used for hybridization, with the label providing the detectable signal. One method for 
evaluating the presence, absence, or quantity of CLASP-2 mRNA is carrying out a Northern 
transfer of RNA from a sample and hybridization of a labeled CLASP-2 specific nucleic acid 
1 5 probe. A useful method for evaluating the presence, absence, or quantity of DNA encoding 
CLASP-2 proteins in a sample involves a Southern transfer of DNA from a sample and 
hybridization of a labeled CLASP-2 specific nucleic acid probe. 

Other common hybridization formats include sandwich assays and 
competition or displacement assays. Sandwich assays are commercially useful hybridization 
20 assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture" 

nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in 
solution. The biological or clinical sample will provide the target nucleic acid. The 
"capture" nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid 
to form a "sandwich" hybridization complex. To be effective, the signal nucleic acid cannot 
25 hybridize with the capture nucleic acid. 

In one embodiment, CLASP-2 polypeptides or polynucleotides are useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
activation, differentiation of immune cells. Immune cells develop through a process called 
hematopoiesis, producing myeloid (platelets, red blood cells, neutrophils, and macrophages) 
30 and lymphoid (B and T lymphocytes) cells from pluripotent stem cells. The etiology of these 
immune deficiencies or disorders can be genetic, somatic, such as cancer or some 
autoimmune disorders, acquired (e.g., by chemotherapy or toxins), or infectious. 

In another embodiment, CLASP-2 polynucleotides or polypeptides are useful 
in treating or detecting deficiencies or disorders of hematopoietic cells. CLASP-2 
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polypeptides or polynucleotides could be used to increase differentiation and proliferation of 
hematopoietic cells, including the pluripotent stem cells, in an effort to treat those disorders 
associated with a decrease in certain (or many) types hematopoietic cells. Examples of 
immunologic deficiency syndromes include, but are not limited to: blood protein disorders 
5 (e .g., agammaglobulinemia, dysgammaglobulinemia), ataxia telangiectasia, common variable 
immunodeficiency, Digeorge Syndrome, HIV infection, HTLV-BLV infection, leukocyte 
adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, 
or hemoglobinuria. 

10 In one embodiment, CLASP-2 polynucleotides or polypeptides are useful in 

treating or detecting autoimmune diseases. The term "autoimmune disease" as used herein 
has the normal meaning in the art and refers to a spontaneous or induced malfunction of the 
immune system of mammals in which the immune system fails to distinguish between 
foreign immunogenic substances within the mammal and/or autologous ("self) substances 

15 and, as a result, treats autologous ("self) tissues and substances as if they were foreign and 
mounts an immune response against them. Autoimmune disease is characterized by 
production of either antibodies that react with self tissue, and/or the activation of immune 
effector T cells that are autoreactive to endogenous self antigens. Three main 
immunopathologic mechanisms act to mediate autoimmune diseases: 1) autoantibodies are 

20 directed against functional cellular receptors or other cell surface molecules, and either 
stimulate or inhibit specialized cellular function with or without destruction of cells or 
tissues; 2) autoantigen—autoantibody immune complexes form in intercellular fluids or in the 
general circulation and ultimately mediate tissue damage; and 3) lymphocytes produce tissue 
lesions by release of cytokines or by attracting other destructive inflammatory cell types to 

25 the lesions. These inflammatory cells in turn lead to production of lipid mediators and 
cytokines with associated inflammatory disease. 

Since many autoimmune disorders result from inappropriate recognition of 
self as foreign material by immune cells. This inappropriate recognition results in an immune 
response leading to the destruction of the host tissue. Therefore, the administration of 

30 CLASP-2 polypeptides or polynucleotides that can inhibit an immune response, particularly 
the proliferation, or differentiation of T-cells, can be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by CLASP- 
2 include, but are not limited to: Addison's Disease, hemolytic anemia, antiphospholipid 
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syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, glomerulonephritis, 
Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, Myasthenia Gravis, Neuritis, 
Ophthalmia, Bullous Pemphigoid, Pemphigus, Polyendocrinopathies, Purpura, Reiter's 
Disease, Stiff-Man Syndrome, Autoimmune Thyroiditis, Systemic Lupus Erythematosus, 
5 Autoimmune Pulmonary Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes 
mel litis, and autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, can also be treated by CLASP-2 polypeptides 
or polynucleotides. Moreover, CLASP-2 can be used to treat anaphylaxis or hypersensitivity 

10 to an antigenic molecules. 

In one embodiment CLASP-2 polynucleotides or polypeptides are used to treat 
and/or prevent organ rejection or graft- versus-host disease (GVHD). Organ rejection occurs 
by host immune cell destruction of the transplanted tissue through an immune response. 
Similarly, an immune response is also involved in GVHD, but, in this case, the foreign 

1 5 transplanted immune cells destroy the host tissues. The administration of CLASP-2 
polypeptides or polynucleotides that inhibits an immune response, particularly the 
proliferation, differentiation of T-cells, can be an effective therapy in preventing organ 

rejection or GVHD. 

Similarly, in another embodiment, CLASP-2 polypeptides or polynucleotides 
20 are used to modulate inflammation. The term "inflammation" refers to both acute responses 
(i.e., responses in which the inflammatory processes are active) and chronic responses (i.e., 
responses marked by slow progression and formation of new connective tissue). Acute and 
chronic inflammation can be distinguished by the cell types involved. Acute inflammation 
often involves polymorphonuclear neutrophils; whereas chronic inflammation is normally 
25 characterized by a lymphohistiocytic and/or granulomatous response. Inflammation includes 
reactions of both the specific and non-specific defense systems. A specific defense system 
reaction is a specific immune system reaction response to an antigen (possibly including an 
autoantigen). A non-specific defense system reaction is an inflammatory response mediated 
by leukocytes incapable of immunological memory. Such cells include granulocytes, 
30 macrophages, neutrophils and eosinophils. 

For example, CLASP-2 polypeptides or polynucleotides can inhibit the 
proliferation and differentiation of cells involved in an inflammatory response. These 
molecules can be used to treat inflammatory conditions, both chronic and acute conditions, 
including inflammation associated with infection (e.g., septic shock, sepsis, or systemic 
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inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, 
arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or chemokine 
induced lung injury, inflammatory bowel disease, Crohn's disease, or resulting from over 
production of cytokines (e.g., TNF or IL-L). Examples of specific types of inflammation are 
diffuse inflammation, focal inflammation, croupous inflammation, interstitial inflammation, 
obliterative inflammation, parenchymatous inflammation, reactive inflammation, specific 
inflammation, toxic inflammation and traumatic inflammation. 

In another embodiment CLASP-2 polypeptides or polynucleotides are used to 
treat or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, infectious 
diseases can be treated. The immune response can be increased by either enhancing an 
existing immune response, or by initiating a new immune response. CLASP-2 polypeptides 
or polynucleotides can be used to treat or detect any of these symptoms or diseases. 

5.4.2,2. Use of CLASP-2 Polynucleotides in Screening 

The presence or absence of hCLASP-2 nucleotide and amino acid sequences 
in a biological sample can be used in screening assays as medical diagnostics to aid in clinical 
decision-making. In one embodiment, hCLASP-2-based diagnostics involves screening 
assays for vaginal bleeding of unknown cause. In several examples discussed below, the 
cause of the bleeding can be in part differentiated by knowledge of whether the vaginal 
bleeding contains placental components (Hart FD, Ed., 1985, French's Index of Differential 
Diagnosis, 12th Ed. John Wright & Sons, pp. 561-63). In these cases, the high expression of 
hCLASP-2 nucleotide sequences in placenta relative to its low expression in blood (FIG. 4A) 
will allow the detection of the presence of placenta based on the presence of the hCLASP-2 
nucleotide or protein. Such detection can be achieved by quantitative RT-PCR, Northern 
analysis., Western analysis, ELISAs, and fluorescence activated cell sorting (FACS) by using 
labeled anti-hCLASP-2 antibodies (Sambrook et aL 9 1989, Molecular Cloning, 2nd Ed., Cold 
Spring Harbor Lab. Press; Harlow et. al., 1988, Antibodies, a laboratory manual, Cold Spring 
Harbor Lab. Press). 

For example, hCLASP-2 can be used in the following screening assays: 

(1) A woman gives birth and presents with post-partum bleeding. In 
this case the presence of placental tissue indicates a condition called "retained products of 
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conception" that requires surgical evacuation of the uterus (Dechemey and Pernol, Eds., 
1996, Current Obstetric & Gynecologic Diagnosis & Treatment, 8th Ed. McGraw Hill). 

(2) A pregnant woman suffers from vaginal bleeding of unknown 
origin. In this case the presence of placenta] tissue indicates a condition called "threatened 
abortion" that implies a poor prognosis for carrying the fetus to term (Decherney and Pernol, 
Eds., 1 996, Current Obstetric & Gynecologic Diagnosis & Treatment, 8th Ed. McGraw Hill). 

(3) A woman of child bearing age presents with vaginal bleeding and is 
found to have a positive pregnancy test without evidence of an intra-uterine pregnancy. In 
this case, the most serious of the differential diagnoses is ectopic pregnancy, a medical 
emergency. However, another common diagnosis is a completed abortion or miscarriage. The 
presence of products of conception (Le. placenta) in the vaginal bleeding strongly favors the 
diagnosis of completed abortion over that of ectopic pregnancy (Decherney and Pernol, Eds., 
1996, Current Obstetric & Gynecologic Diagnosis & Treatment, 8th Ed. McGraw Hill). 

In another embodiment, hCLASP-2-based diagnostics involve screening 
assays to determine injury to vital tissues that express hCLASP-2 at high levels. Such tissues 
include kidney, heart, and lung (Fig 4A). Injury to these tissues can result in leakage of cells 
and cellular constituents including hCLASP-2 into surrounding fluids (specified below). 
Detection of abnormally high levels of hCLASP-2 protein in these surrounding fluids by 
Western analysis or ELISA, or detection of abnormally high levels of hCLASP-2 RNA in 
these fluids by RT-PCR or Northern analysis is expected to aid in the diagnosis of tissue 
injury. 

In the case of renal injury, the hCLASP-2 nucleotide or amino acid sequences 
or fragments thereof would be expected to appear in the urine. Detection of abnormally high 
levels of hCLASP-2 can aid in the diagnosis of both nephritis and tubular necrosis, and 
differentiate from non-renal causes of proteinuria. Early diagnosis of nephritis is of particular 
value in patients with clinical signs and symptoms suggestive of systemic lupus 
erythematosis in whom early diagnosis and treatment of lupus nephritis can prevent 
irreversible kidney damage (Cameron J.S., 1999, J Nephrol 12 Suppl 2: S29-41). While 
tubular necrosis currently cannot be reversed by pharmacotherapy, differentiation of tubular 
necrosis from pre-renal failure is critical in formulating a treatment plan for oligouric 
hospitalized patients (Bidani A. and Churchill P.C., 1989, Dis Mon 35: 57-132). 

In the case of myocardial injury, the hCLASP-2 nucleic or amino acid 
sequence or fragments thereof are expected to appear in the blood. This is analogous to 
current standard practice of monitoring for other elevated levels myocardial proteins (e.g., 
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creatine kinase, troponin) in the blood following myocardial infarction and ischemia by 
standard ELISA or electrophoretic methodologies (Fauci et al., (eds.), 1998, Harrison's 
Principles of Internal Medicine, 14th Ed., McGraw Hill, pp. 1352-1375). The presence of 
hCLASP-2 in cardiac muscle and its absence in skeletal muscle and blood makes hCLASP-2 
an ideal marker to diagnose and monitor myocardial injury. 

Unlike myocardial injury, pulmonary injury is not routinely diagnosed by 
assaying serum for lung-specific proteins. By analogy to myocardial infarction, pulmonary 
infarction also releases lung-specific proteins and cells into systemic circulation. Pulmonary 
infarction due to pulmonary embolism (PE) or pneumonia is expected to release hCLASP-2- 
bearing cells or protein/peptides into systemic circulation. Detection of hCLASP-2 protein in 
serum or RNA in blood can aid in the diagnosis of pulmonary infarction in the appropriate 
clinical setting. Current methods to diagnose PE are not only expensive but lack specificity 
and sensitivity, and the misdiagnosis of this condition is a leading cause of preventable death 
in hospitalized patients (Raskob G.E. and Hull R.D., 1999, Curr Opin Hematol. 6(5): 280-4). 

In another embodiment, hCLASP-2-based diagnostics involve screening 
assays for identifying disorders of cells of hematopoietic lineage. hCLASP-2 is expressed in 
human T cells, B cells but not cells from the myeloid lineage. Different hCLASP-2 isoforms 
in T and B cells permit further discrimination between malignancies of T and B lineage (FIG. 
4B). Precise identification of hematopoietic cell types is vital to guide chemotherapy and 
radiation therapy of patients with leukemia and lymphoma (Fauci et al Eds., 1998, Harrison's 
Principles of Internal Medicine, 14th Ed. McGraw Hill, pp. 695-712). hCLASP-2 expression 
differences can be detected by using FACS, immunofluorescence, immunoperoxidase 
staining, RT-PCR, in situ hybridization or RNA blot analysis (Sambrook, Ffitsch and 
Maniatas, Molecular Cloning, 2nd Ed. Cold Spring Harbor Lab. Press, 1989; Ward MS, 
25 Pathology 1 999 Nov; 3 1(4): 382-92). 

In another embodiment, hCLASP-2-based diagnostics involve screening 
assays for identifying activated immune system cells. Although hCLASP-2 is generally 
expressed at quite low levels in PBMCs (which is critical for some of the above applications), 
it is known that the surface expression of the closely related mouse CLASP- 1 protein is 
30 altered during the process of lymphocyte activation. An analogous change in expression is 
expected for the hCLASP-2 protein. Subtyping lymphocytes specific for a particular antigen, 
for example, using. MHC-based multimeric staining reagents (Altaian et. al., 1996, Science 
274: 94-6), for separating cell populations into hCLASP-2 high and hCLASP-2 low 
populations, can aid in determining the nature of the immune response against that antigen. 


20 
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Such understanding is critical, for example, in predicting the course of chronic viral 
infections such as hepatitis B, hepatitis C, and HIV, and to designing appropriate treatment 
regimens for patients suffering from these infections. 

hCLASP-2 can also serve as a potential therapeutic agent for Wilms* tumor. 
Wilms tumor is the most common primary renal tumor of childhood (Cotran, Kumar, and 
Collins, 1999, Robbins Pathologic Basis of Disease, 6th Ed. W.B. Saunders, pp. 487-89). As 
discussed herein, hCLASP-2 is highly expressed in 293 cells, embryonic kidney epithelial 
cells. Therefore, hCLASP-2 nucleic or amino acid sequence or fragments can serve as tumor 
markers for Wilms' tumor. Antibodies directed against a hCLASP-2 variant that is expressed 
only in Wilms* tumor can serve as novel therapeutic agents for Wilms* tumor, and can also 
function as delivery vehicles for other targeted therapeutics that may be attached to the anti- 
hCLASP-2 antibody (e.g., chemotherapeutics or radiolabeling). 

5.4,2.2.1. CLASP-2 Antisense, Ribozyme and Triplex Polynucleotides and Methods 

of Use 

Oligonucleotide sequences, that include anti-sense RNA and DNA molecules 
and ribozymes that function to inhibit the translation of a CLASP-2 mRNA are within the 
scope of the invention. Such molecules are useful in cases where downregulation of CLASP- 
2 expression is desired. Anti-sense RNA and DNA molecules act to directly block the 
translation of mRNA by binding to targeted mRNA and preventing protein translation. The 
invention provides methods and antisense oligonucleotide or polynucleotide reagents which 
can be used to reduce expression of CLASP-2 gene products in vitro or in vivo. 
Administration of the antisense reagents of the invention to a target cell results in reduced 
CLASP activity. As will be apparent to one of skill and as discussed supra (Table 3), 
specific CLASP-2 splice variants can be specifically targeted for inhibition. Alternatively, by 
designing an, e.g., antisense molecule that recognizes a sequence found in several or all 
CLASP-2 species, a general inhibition can be achieved. 

A. Antisense 

Without intending to be limited to any particular mechanism, it is believed that 
antisense oligonucleotides bind to, and interfere with the translation of, the sense CLASP-2 
mRNA. Alternatively, the antisense molecule can render the CLASP-2 mRNA susceptible to 
nuclease digestion, interfere with transcription, interfere with processing, localization or 
otherwise with RNA precursors ("pre-mRNA"), repress transcription of mRNA from the 
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CLASP-2 gene, or act through some other mechanism. However, the particular mechanism 

by which the antisense molecule reduces CLASP-2 expression is not critical. 

The antisense polynucleotides of the invention comprise an antisense sequence 

of at least 7 to 10 to typically 20 or more nucleotides that specifically hybridize to a sequence 

5 from mRNA encoding CLASP-2 or mRNA transcribed from the CLASP-2 gene. More often, 

the antisense polynucleotide of the invention is from about 10 to about 50 nucleotides in 

iengih or from about 14 to about 35 nucleotides in length. In other embodiments, antisense 

polynucleotides are polynucleotides of less than about 100 nucleotides or less than about 200 

nucleotides. In general, the antisense polynucleotide should be long enough to form a stable 

10 duplex but short enough, depending on the mode of delivery, to administer in vivo, if desired. 

The minimum length of a polynucleotide required for specific hybridization to a target 

sequence depends on several factors, such as G/C content, positioning of mismatched bases 

(if any), degree of uniqueness of the sequence as compared to the population of target 

# 

polynucleotides, and chemical nature of the polynucleotide (e.g., methylphosphonate 

15 backbone, peptide nucleic acid, phosphorothioate), among other factors. Generally, to. assure 
specific hybridization, the antisense sequence is substantially complementary to the target 
CLASP-2 mRNA sequence. In certain embodiments, the antisense sequence is exactly 
complementary to the target sequence. The antisense polynucleotides can also include, 
however, nucleotide substitutions, additions, deletions, transitions, transpositions, or 

20 modifications, or other nucleic acid sequences or non-nucleic acid moieties so long as 

specific binding to the relevant target sequence corresponding to CLASP-2 RNA or its gene 
is retained as a functional property of the polynucleotide. 

It will be appreciated that the CLASP-2 polynucleotides and oligonucleotides 
of the invention can be made using nonstandard bases (e.g., other than adenine, cytidine, 

25 guanine, thymine, and uridine) or nonstandard backbone structures to provides desirable 
properties (e.g., increased nuclease-resistance, tighter-binding, stability or a desired TM). 
Techniques for rendering oligonucleotides nuclease-resistant include those described in PCT 
publication WO 94/12633. A wide variety of useful modified oligonucleotides may be 
produced, including oligonucleotides having a peptide-nucleic acid (PNA) backbone (Nielsen 

30 et ai 9 1991, Science 254: 1497) or incorporating 2 , -0-methyl ribonucleotides, 

phosphorothioate nucleotides, methyl phosphonate nucleotides, phosphotriester nucleotides, 
phosphorothioate nucleotides, phosphoramidates. Still other useful oligonucleotides may 
contain alkyl and halogen-substituted sugar moieties .comprising one of the following at the 
2' position: OH, SH, SCH3, F, OCN, OCH30CH3, OCH30(CH2)nCH3, 0(CH2)nNH2 or 
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0(CH2)nCH3, where n is from 1 to about 10; CI to CIO lower alkyl, substituted lower alkyl, 
alkaryl or aralkyl; CI; Br; CN; CF3; OCF3; O-, S-, or N-alkyl; O-, or N-alkenyl; SOCH3 ; 
S02CH3; ON02; N02; N3; NH2; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; 
polyalkylamino; substituted silyl; an RNA cleaving group; a cholesteryl group; a folate 
group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties 
of an oligonucleotide; or a group for improving the pharmacodynamic properties of an 
oligonucleotide and other substituents having similar properties. Folate, cholesterol or other 
groups that facilitate oligonucleotide uptake, such as lipid analogs, may be conjugated 
directly or via a linker at the V position of any nucleoside or at the 3' or 5* position of the 3'- 
terminal or SMerminal nucleoside, respectively. One or more such conjugates may be used. 
Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of the 
pentofuranosyl group. Other embodiments may include at least one modified base form or 
"universal base" such as inosine, or inclusion of other nonstandard bases such as queosine 
and wybutosine as well as acetyl-, methyl-, thio- and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. The antisense oligonucleotide can comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyi-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouraciI, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-^iaininopiirine. 

The invention further provides oligonucleotides having backbone analogues 
such as phosphodiester, phosphorothioate, phosphorodithioate, methylphosphonate, 
phosphoramidate, alkyl phosphotriester, sulfamate, 3'-thioacetal, methylene(methylimino), 
3'-N-carbamate, morpholino carbamate, chiral-methyl phosphonates, nucleotides with short 
chain alkyl or cycloalkyl intersugar linkages, short chain heteroatomic or heterocyclic 
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inlersugar ("backbone") linkages, or CH2-NH-0-CH2, CH2-N(CH3)-OCH2, CH2-0- 
N(CH3)-CH2, CH2-N(CH3)-N(CH3)-CH2 and 0-N(CH3)-CH2-CH2 backbones (where 
phosphodiester is 0-P-0-CH2), or mixtures of the same. Also useful are oligonucleotides 
having morpholino backbone structures (U.S. Patent No. 5,034,506). 
5 Useful references include Oligonucleotides and Analogues, A Practical 

Approach, edited by F. Eckstein, IRL Press at Oxford University Press (1991); Antisense 
Strategies, Annals of the New York Academy of Sciences, Volume 600, Eds. Baserga and 
Denhardt (NY AS 1992); Milligan et al. y 9 July 1993, J. Med. Chem, 36(14): 1923-1937; 
Antisense Research and Applications (1993, CRC Press), in its entirety and specifically 

10 Chapter 15, by Sanghvi, entitled "Heterocyclic base modifications in nucleic acids and their 
applications in antisense oligonucleotides;" and Antisense Therapeutics, ed. Sudhir Agrawal 
(Humana Press, Totowa, New Jersey, 1996). 

In one embodiment, the antisense sequence is complementary to relatively 
accessible sequences of the CLASP-2 mRNA (e.g., relatively devoid of secondary structure). 

15 This can be determined by analyzing predicted RNA secondary structures using, for example, 
the MFOLD program (Genetics Computer Group, Madison WI) and testing in vitro or in vivo 
as is known in the art. Another useful method for identifying effective antisense compositions 
uses combinatorial arrays of oligonucleotides (see, e.g., Milner et al, 1997, Nature 
Biotechnology 15: 537). Examples of oligonucleotides that can be tested in cells for antisense 

20 suppression of CLASP-2 function are those capable of hybridizing to (Le. y substantially 
complementary to) the following positions from SEQUENCE ED NO: 1 : 

1 ) GAAGGCGATCATCACGTGGCCTTCC ATCGC 

2) GCTTCAAGTAATGACTGGTGCAGAACATCTG 

3) GCTCCTCCTCAGGCAGGCGCTATGGCTGTGG 
25 _ 4) GTAGGCCCGGTGCAGCGTGTCATACAGATGG 

(See also Example 8) . 

In some embodiments, administration of antisense oligonucleotides will result 
in reduction of hCLASP-mRNA expression by at least about 50%, as assessed by Northern 
analysis after administration of an antisense phosphorothioate oligonucleotide at a 
30 concentration of 1 pM, 5 pM, 10 pM or 20 pM. 

The invention also provides an antisense polynucleotide that has sequences in 
addition to the antisense sequence (i.e., in addition to anti-CLASP-2-sense sequence). In this 
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case, the antiscnse sequence is contained within a polynucleotide oflonger sequence. In 
another embodiment, the sequence of the polynucleotide consists essentially of, or is, the 
antisense sequence. 

The antiscnse nucleic acids (DNA, RNA, modified, analogues, and the like) 

5 can be made using any suitable method for producing a nucleic acid, such as the chemical 
synthesis and recombinant methods disclosed herein. In one embodiment, for example, 
antisense RNA molecules of the invention can be prepared by de novo chemical synthesis or 
by cloning For example, an antisense RNA that hybridizes to CLASP-2 mRNA can be made 
by inserting (hgatmg) an CLASP-2 DNA sequence {e.g., SEQUENCE ED No: 1, or fragment 

10 thereof) in reverse orientation operably linked to a promoter in a vector {e.g., plasmid). 
Provided that the promoter and, preferably termination and polyadenylation signals, are 
properly positioned, the strand of the inserted sequence corresponding to the noncoding 
strand will be transcribed and act as an antisense oligonucleotide of the invention. The term 
"operably linked" refers to a functional linkage between a nucleic acid expression control 

1 5 sequence (such as a promoter or enhancer) and a second nucleic acid sequence, wherein the 
expression control sequence directs transcription of the nucleic acid corresponding to the 
second sequence. 

In one embodiment, antisense DNA oligodeoxyribonucleotides derived from 
the translation initiation site, e.g., between -10 and +10 regions of a CLASP-2 nucleotide 
20 sequence, are used. For general methods relating to antisense polynucleotides, see Antisense 
RNA AND DNA, 1988, D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). See also, Daglee* al 9 1991, Nucleic Acids Research, 19: 1805. For a review 
of antisense therapy, see, e.g., Uhlmann et aL, 1990, Chem. Reviews, 90: 543-584. 

B. Ribozvme 

25 Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 

cleavage of RNA. The mechanism of ribozyme action involves sequence specific 
hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. Within the scope of the invention are engineered hammerhead 
motif ribozyme molecules that specifically and efficiently catalyze endonucleolytic cleavage 

30 of CLASP-2 RNA sequences. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites which include the 
following sequences, GUA, GUU and GUC. Once identified, short RNA sequences of 
between 15 and 20 ribonucleotides corresponding to the region of the target gene containing 
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the cleavage site can be evaluated for predicted structural features such as secondary structure 
that can render the oligonucleotide sequence unsuitable. The suitability of candidate targets 
can also be evaluated by testing their accessibility to hybridization with complementary 
oligonucleotides, using ribonuclease protection assays. 
5 C. Triplex 

Alternatively, endogenous target gene expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of the target gene 
(i.e., the target gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the target gene in target cells in the body. (See generally, Helene, 1991 , 

1 0 Anticancer Drug Des., 6(6): 569-584; Helene et al, 1 992, Ann. N.Y. Acad. Sci., 660: 27-36; 
and Maher, 1992, Bioassays 14(12): 807-815). 

Nucleic acid molecules to he used in triplex helix formation for the inhibition 
of transcription should be single stranded and composed of deoxynucleotides. The base 
composition of these oligonucleotides must be designed to promote triple helix formation via 

1 5 Hoogsteen base pairing rules, which generally require sizable stretches of either purines or 
pyrimidines to be present on one strand of a duplex. Nucleotide sequences can be 
pyrimidine-based, which will result in TAT and CGC + triplets across the three associated 
strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
cornplementarily to a purine-rich region of a single strand of the duplex in a parallel 

20 orientation to that strand. In addition, nucleic acid molecules can be chosen that are purine- 
rich, for example, contain a stretch of G residues. These molecules will form a triple helix 
with a DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in GGC triplets across the three 
strands in the triplex. 

25 Alternatively, the potential sequences that can be targeted for triple helix 

formation can be increased by creating a so called "switchback" nucleic acid molecule. 
Switchback molecules are synthesized in an alternating 5*-3\ 3'-5* manner, such that they 
base pair with first one strand of a duplex and then the other, eliminating the necessity for a 
sizable stretch of either purines or pyrimidines to be present on one strand of a duplex. 

30 D. Genera] 

The anti-sense RNA and DNA molecules, ribozymes and triple helix 
molecules of the invention can be prepared by any method known in the art for the synthesis 
of RNA molecules. These include techniques for chemically synthesizing 
oligodeoxyribonucleotides well known in the art such as for example solid phase 
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phosphoramidite chemical synthesis. Alternatively, RNA molecules can be generated by in 
vitro and in vivo transcription of DNA sequences encoding the antisense RNA molecule. 
Such DNA sequences can be incorporated into a wide variety of vectors which contain 
suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 

5 Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 
inducibly, depending on the promoter used, can be introduced stably into cell lines. 

Various modifications to the DNA molecules can be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include, but are not 
limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5* and/or 

10 3' ends of the molecule or the use of phosphorothioate or 2' O-methyl rather than 
phosphodiesterase linkages within the oligodeoxyribonucleotide backbone. 

Methods for introducing polynucleotides into such cells or tissue include 
methods for in vitro introduction of polynucleotides such as the insertion of naked 
polynucleotide, i.e., by injection into tissue, the introduction of a CLASP-2 polynucleotide in 

1 5 a cell ex vivo, the use of a vector such as a virus, {e.g., a retrovirus, adenovirus, adeno- 
associated virus, and the like), phage or plasmid, and the like or techniques such as 
electroporation or calcium phosphate precipitation. 

5.4.2.2.2. Gene Therapy 

By introducing gene sequences into cells, gene therapy can be used to treat 
20 conditions in which the cells do not express normal CLASP-2 or express abnormal/inactive 
CLASP-2. In some instances, the polynucleotide encoding a CLASP-2 is intended to replace 
or act in the place of a functionally deficient endogenous gene. Alternatively, abnormal 
conditions characterized by overexpression can be treated using the gene therapy techniques 
described below. 

25 In a specific embodiment, nucleic acids comprising a sequence encoding a 

CLASP-2 protein or functional derivative thereof, are administered to promote CLASP-2 
function, by way of gene therapy. Gene therapy refers to therapy performed by the 
administration of a nucleic acid to a subject In this embodiment of the invention, the nucleic 
acid produces its encoded protein that mediates a therapeutic effect by promoting CLASP-2 

30 function. 

Any of the methods for gene therapy available in the art can be used according 
to the present invention. Exemplary methods are described below. 
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For general reviews of the methods of gene therapy, see, GoldspieJ-e/ al., 
1993, Clinical Pharmacy 12: 488-505; Wu and Wu, 1991, Biotherapy 3: 87-95; Tolstoshev, 
1993, Ann. Rev. Pharmacol. Toxicol. 32: 573-596; Mulligan, 1993, Science 260: 926-932; 
and Morgan and Anderson, 1993, Ann. Rev. Biochem. 62: 191-217; Can, 1993, TIBTECH 
5 1 1 (5): 1 55-21 5). Methods commonly known in the art of recombinant DNA technology 

which can be used are described in Ausubel et al. y supra; and Kriegler, 1990, Gene Transfer 
and Expression, A Laboratory Manual, Stockton Press, NY. 

In one aspect, the therapeutic composition comprises a CLASP-2 nucleic acid 
that is part of an expression vector that encodes a CLASP-2 protein or fragment or chimeric 

1 0 protein thereof in a suitable host. In particular, such a nucleic acid has a promoter operably 
linked to the CLASP-2 coding region, said promoter being inducible or constitutive, and, 
optionally, tissue-specific. In another particular embodiment, a nucleic acid molecule is used 
in which the CLASP-2 coding sequences and any other desired sequences are flanked by 
regions that promote homologous recombination at a desired site in the genome, thus 

15 providing for intrachromosomal expression of the CLASP-2 nucleic acid (Koller and 

Smithies, 1989, Proc. Natl. Acad. Sci. U.S.A. 86: 8932-8935; Zijlstra et al, 1989, Nature 
342: 435-438). 

Delivery of the nucleic acid into a patient can be either direct, in which case 
the patient is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, 
20 in which case, cells are first transformed with the nucleic acid in vitro, then transplanted into 
the patient. These two approaches are known, respectively, as in vivo or ex vivo gene 
therapy. 

In a specific embodiment, the nucleic acid is directly administered in vivo 9 
where it is expressed to produce the encoded product. This can be accomplished by any of 

25 numerous methods known in the art, e.g., by constructing it as part of an appropriate nucleic 
acid expression vector and administering it so that it becomes intracellular, e.g., by infection 
using a defective or attenuated retroviral or other viral vector (see, U.S. Patent No. 
4,980,286), or by direct injection of naked DNA, or by use of microparticle bombardment 
(e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 

30 transfecting agents, encapsulation in liposomes, microparticles, or microcapsules, or by 

administering it in linkage to a peptide which is known to enter the nucleus, by administering 
it in linkage to a ligand subject to receptor-mediated endocytosis (see, e.g., Wu and Wu. 
1987, J. Biol. Chem. 262: 4429-4432) (which can be used to target cell types specifically 
expressing the receptors), and the like. In another embodiment, a nucleic acid-ligand 
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complex can be formed in which the ligand comprises a fusogenic viral peptide to disrupt 
endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet another 
embodiment, the nucleic acid can be targeted in vivo for cell specific uptake and expression, 
by targeting a specific receptor (see, e.g., PCT Publications WO 92/06180 dated April 16, 
1992; WO 92/22635 dated December 23, 1992; WO 92/203 1 6 dated November 26, 1992; 
WO 93/141 88 dated July 22, 1993; WO 93/20221 dated October 14, 1993). Alternatively, 
the nucleic acid can be introduced intracellularly and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and Smithies, 1989, Proc. Natl. Acad. Sci. 
U.S.A. 86: 8932-8935; Zijlstra et al, 1989, Nature 342: 435-438). 

In a specific embodiment, a viral vector that contains the CLASP-2 nucleic 
acid is used. For example, a retroviral vector can be used (see, Miller et al, 1993, Meth. 
Enzymol. 217: 581-599). These retroviral vectors have been modified to delete retroviral 
sequences that are not necessary for packaging of the viral genome and integration into host 
cell DNA. The CLASP-2 nucleic acid to be used in gene therapy is cloned into the vector, 
which facilitates delivery of the gene into a patient. More detail about retroviral vectors can 
be found in Boesen et al, 1994, Biotherapy 6: 291-302, which describes the use of a 
retroviral vector to deliver the mdrl gene to hematopoietic stem cells in order to make the 
stem cells more resistant to chemotherapy. Other references illustrating the use of retroviral 
vectors in gene therapy are: Clowes et al, 1994, J. Clin. Invest. 93: 644-651; Kiem et al, 
1994, Blood S3: 1467-1473; Salmons and Gunzberg, 1993, Human Gene Therapy 4: 129- 
141; and Grossman and Wilson, 1993, Curr. Opin. in Genetics and Devel. 3: 110-1 14. 

Adenoviruses are other viral vectors that can be used in gene therapy. 
Adenoviruses are especially attractive vehicles for delivering genes to respiratory epithelia. 
Adenoviruses naturally infect respiratory epithelia where they cause a mild disease. Other 
targets for adenovirus-based delivery systems are liver, the central nervous system, 
endothelial cells, and muscle. Adenoviruses have the advantage of being capable of infecting 
non-dividing cells. Kozarsky and Wilson 1993, Current Opinion in Genetics and 
Development 3: 499-503) present a review of adenovirus-based gene therapy. Bout et al, 
1994, Human Gene Therapy 5: 3-10, demonstrated the use of adenovirus vectors to transfer 
genes to the respiratory epithelia of rhesus monkeys. Other instances of the use of 
adenoviruses in gene therapy can be found in Rosenfeld et al, 1991, Science 252: 431-434; 
Rosenfeld et al, 1992, Cell 68: 143-155; and Mastrangeli et al, 1993, J. Clin. Invest. 91: 
225-234. Adeno-associated virus (AAV) has also been proposed for use in gene therapy 
(Walsh et al, 1993, Proc. Soc. Exp. Biol. Med. 204: 289-300. 
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Another approach to gene therapy involves transferring a gene to Gells in 
tissue culture by such methods as electroporation, lipofection, calcium phosphate mediated 
transfection, or viral infection. Usually, the method of transfer includes the transfer of a 
selectable marker to the cells. The cells are then placed under selection to isolate those cells 
5 that have taken up and are expressing the transferred gene. Those cells are then delivered to a 
patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to 
administration in vivo of the resulting recombinant cell. Such introduction can be carried out 
by any method known in the art, including but not limited to transfection, electroporation, 

10 microinjection, infection with a viral or bacteriophage vector containing the nucleic acid 
sequences, cell fusion, chromosome-mediated gene transfer, microcell-mediated gene 
transfer, spheroplast fusion, and the like. Numerous techniques are known- in the art for the 
introduction of foreign genes into cells (see, e.g., Loeffler and Behr, 1993, Meth. Enzymol. 
217: 599-618; Cohen et al, 1993, Meth. Enzymol. 217: 618-644; Cline, 1985, Pharmac. 

15 Ther. 29: 69-92) and can be used in accordance with the present invention, provided that the 
necessary developmental and physiological functions of the recipient cells are not disrupted. 
The technique should provide for the stable transfer of the nucleic acid to the cell, so that the 
nucleic acid is expressible by the cell and preferably heritable and expressible by its cell 
progeny. 

20 The resulting recombinant cells can be delivered to a patient by various 

methods known in the art. In a preferred embodiment, epithelial cells are injected, e.g., 
subcutaneously. In another embodiment, recombinant skin cells can be applied as a skin graft 
onto the patient. Recombinant blood cells (e.g., hematopoietic stem or progenitor cells) are 
preferably administered intravenously. The amount of cells envisioned for use depends on 

25 the desired effect, patient state, and the like., and can be determined by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of gene therapy 
encompass any desired, available cell type, and include but are not limited to epithelial cells, 
endothelial cells, keratinocytes, fibroblasts, muscle cells, hepatocytes; blood cells such as T 
lymphocytes, B lymphocytes, monocytes, macrophages, neutrophils, eosinophils, 

30 megakaryocytes, granulocytes; various stem or progenitor cells, in particular hematopoietic 
stem or progenitor cells, e.g., as obtained from bone marrow, umbilical cord blood, peripheral 
blood, fetal liver, and the like. In a preferred embodiment, the cell used for gene therapy is 
autologous to the patient. 
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In a* specific embodiment, the nucleic acid to be introduced for purposes of 
gene therapy comprises an inducible promoter operably linked to the coding region, such that 
expression of the nucleic acid is controllable by controlling the presence or absence of the 
appropriate inducer of transcription. 

5 5,4.2.3. Knockout Cells 

In one aspect of the invention, endogenous target gene expression can also be 
reduced by inactivating or "knocking out" the target gene or its promoter using targeted 
homologous recombination (see, e.g.. Smithies et al t 1985, Nature 317: 230-234; Thomas 
and Capecchi, 1987, Cell 51: 503-512; Thompson et aL. 1989, Cell 5: 313-321; each of 
1 0 which is incorporated by reference herein in its entirety). For example, a mutant, non- 
functional target gene (or a completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous target gene (either the coding regions or regulatory regions of 
the target gene) can be used, with or without a selectable marker and/or a negative selectable 
marker, to transfect cells that express the target gene in vivo. Insertion of the DNA construct, 
1 5 via targeted homologous recombination, results in inactivation of the target gene. Such 
approaches are particularly suited in the agricultural field where modifications to ES 
(embryonic stem) cells can be used to generate animal offspring with an inactive target gene 
(see, e.g., Thomas and Capecchi, 1987 and Thompson, 1989, supra). However, this approach 
can be adapted for use in humans provided the recombinant DNA constructs are directly 
20 administered or targeted to the required site in vivo using appropriate viral vectors. 

5.4.2.4. Transgenic and Knockout Animals 

The CLASP-2 gene product can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, sheep, and non-human primates, e.g., baboons, monkeys, and chimpanzees 

25 can be used to generate CLASP-2 transgenic animals. The term "transgenic," as used herein, 
refers to animals expressing CLASP-2 gene sequences from a different species (e.g. t mice 
expressing human CLASP-2 gene sequences), as well as animals that have been genetically 
engineered to overexpress endogenous (i.e., same species) CLASP-2 sequences or animals 
that have been genetically engineered to no longer express endogenous CLASP-2 gene 

30 sequences (i.e. , 'Toiock-ouf animals), and their progeny. 

Any technique known in the art can be used to introduce a CLASP-2 transgene 
into animals to produce the founder lines of transgenic animals. Such techniques include, but 
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are not limited to pronuclear microinjection (Hoppe and Wagner, 1989, U.S. Pat.-No. 
4,873,191); retrovirus mediated gene transfer into germ lines (Van der Putten et al f 1985, 
Proc. Natl. Acad. Sci., U.S.A. 82: 6148-6152); gene targeting in embryonic stem cells 
(Thompson et al, 1989, Cell 56: 313-321); electroporation of embryos (Lo, 1983, Mol. Ceil. 
5 Biol. 3: 1803-1814); and sperm-mediated gene transfer (Lavitrano et al, 1989, Cell 57: 717- 
723) (For a review of such techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. 
Cytol. 115, 171-229) 

Any technique known in the art can be used to produce transgenic animal 
clones containing a CLASP-2 transgene, for example, nuclear transfer into enucleated 

10 oocytes of nuclei from cultured embryonic, fetal or adult cells induced to quiescence 
(Campbell et al, 1996, Nature 380: 64-66; Wilmut et al, Nature 385: 810-813). 

The present invention provides for transgenic animals that carry a CLASP-2 
transgene in all their cells, as well as animals that carry the transgene in some, but not all 
their cells, i.e., mosaic animals. The transgene can be integrated as a single transgene or in 

15 concatamers, e.g., head-to-head tandems or head-to-tail tandems. The transgene can also be 
selectively introduced into and activated in a particular cell type by following, for example, 
the teaching of Lasko et al (1992, Proc. Natl. Acad. Sci. U.S.A. 89: 6232-6236). The 
regulatory sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When it is 

20 desired that the CLASP-2 transgene be integrated into the chromosomal site of the 

endogenous CLASP-2 gene, gene targeting is preferred. Briefly, when such a technique is to 
be utilized, vectors containing some nucleotide sequences homologous to the endogenous 
CLASP-2 gene are designed for the purpose of integrating, via homologous" recombination 
with chromosomal sequences, into and disrupting the function of the nucleotide sequence of 

25 the endogenous CLASP-2 gene. The transgene can also be selectively introduced into a 

particular cell type, thus inactivating the endogenous CLASP-2 gene in only that cell type, by 
following, for example, the teaching of Gu et al (1994, Science 265: 103-106). The 
regulatory sequences required for such a cell-type specific inactivation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. 

30 Once transgenic animals have been generated, the expression of the 

recombinant CLASP-2 gene can be assayed utilizing standard techniques. Initial screening 
can be accomplished by Southern blot analysis or PCR techniques to analyze animal tissues 
to assay whether integration of the transgene has taken place. The level of mRNA expression 
of the transgene in the tissues of the transgenic animals can also be assessed using techniques 
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that include, but are not limited to, Northern blot analysis of tissue samples obtained from the 
animal, in situ hybridization analysis, and RT-PCR (reverse transcriptase PCR). Samples of 
CLASP-2 gene-expressing tissue, can also be evaluated immunocytochemically using 
antibodies specific Tor the CLASP-2 transgene product. 

5 5.4.2.5. Other Uses of CLASP-2 Polynucleotides 

There exists an ongoing need to identify new chromosome marking reagents. 
Sequences can be mapped to chromosomes by preparing PCR primers from SEQ ID NO: 1 , 
3, 5, or 9. These primers can be can be less than 50 nucleotides in length, generally less than 
46 nucleotides, more generally less than 41 nucleotides, most generally less than 36 
1 0 nucleotides, preferably less than 3 1 nucleotides, more preferably less than 26 nucleotides, and 
most preferably less than 21 nucleotides in length. The probes can also be less than 16 
nucleotides, less than 13 nucleotides in length, less than 9 nucleotides in length and less than 
7 nucleotides in length. Primers can be selected so that the primers do not span more than 
one predicted exon in the genomic DNA. These primers are then used for PCR screening of 
15 somatic ceil hybrids containing individual human chromosomes (i.e., chromosome 13). Only 
those hybrids containing the human CLASP-2 gene corresponding to SEQ ID NO: 1, 3, 5, or 
9 will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Precise chromosomal location of the CLASP-2 
20 polynucleotides can also be achieved using fluorescence in situ hybridization (FISH) of a 

metaphase chromosomal spread. See Verma, et al, Human Chromosomes: A Manual of Basic 
Techniques, Pergamon Press. NY, 1988. Once a polynucleotide has been "mapped to a 
precise chromosomal location, the physical position of the polynucleotide can be used in 
linkage analysis. Linkage analysis establishes coinheritance between a chromosomal location 
25 and presentation of a particular disease. See McKusick, V., 1998, Mendelian Inheritance in 
Man : A Catalog of Human Genes and Genetic Disorders, 12th Ed, Johns Hopkins University 
Press. 

The CLASP-2 polynucleotides can be used for identifying individuals from 
minute biological samples as DNA markers for restriction fragment length polymorphism 
30 (RFLP). An individual's genomic DNA is digested with one or more restriction enzymes, 
and probed on a Southern blot with CLASP-2 DNA markers to yield unique bands for 
identifying the individual. 
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As described above, upon sequencing of numerous independent cDNA 
products, single nucleotide polymorphisms (SNPs) have been discovered within CLASP-2. 
These alterations and differences are presented in FIG 1 1 B. They represent mis-sense 
alterations. 

5 If it is determined that certain SNPs are deleterious or advantageous, SNPs can 

be used as a diagnostic tool through SNP mapping or direct sequencing of the SNP region to 
determine which isoform is expressed. Additionally, the SNPs can be used as a general SNP 
marker for chromosomal defects such as rearrangement and translocations. 

CLASP-2 polynucleotides can be also be used as polymorphic markers for 
10 forensic analysis. See generally National Research Council, The Evaluation of Forensic DNA 
Evidence (Eds. 1996, Pollard et al. 9 National Academy Press, Washington D.C). The 
capacity to identify a distinguishing or unique set of forensic markers in an individual is 
useful for forensic analysis. For example, one can determine whether a blood sample from a 
suspect matches a blood or other tissue sample from a crime scene by determining whether 
15 the set of polymorphic forms occupying selected polymorphic sites is the same in the suspect 
and the sample. If the set of polymorphic markers does not match between a suspect and a 
sample, it can be concluded (barring experimental error) that the suspect was not the source 
of the sample. If the set of markers does match, one can conclude that the DNA from the 
suspect is consistent with that found at the crime scene. If frequencies of the polymorphic 
20 forms at the loci tested have been determined (e.g., by analysis of a suitable population of 
individuals), one can perform a statistical analysis to determine the probability that a match 
of suspect and crime scene sample would occur by chance. 

To make such an identification, PCR technology can be usedto amplify DNA 
sequences taken from very small biological samples such as tissues, e.g., hair or skin, or body 
25 fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified sequence can then 
be compared to a standard, thereby allowing identification of the origin of the biological 
sample. The CLASP-2 polynucleotide sequences of the present invention can be used to 
provide polynucleotide reagents, e.g., PCR primers, targeted to specific loci in the human 
genome, which can enhance the reliability of DNA-based forensic identifications by, for 
30 example, providing another "identification marker" (i.e. another DNA sequence that is unique 
to a particular individual). As mentioned above, actual base sequence information can be 
used for identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Sequences targeted to noncoding regions of SEQ ID NO: 1, 3, 5 or 9 are 
particularly appropriate for this use as greater numbers of polymorphisms occur in the 
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noncoding regions, making it easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the CLASP-2 nucleotide sequences or portions 
thereof, e.g.. fragments derived from the noncoding regions of SEQ ID NO: 1, 3, 5, or 9 
having a length of at least 20 bases, preferably at least 25 bases, and more preferably at least 
5 30 bases. 

CLASP-2 polynucleotides can also be used as reagents for paternity testing. 
The object of paternity testing is usually to determine whether a male is the father of a child. 
In most cases, the mother of the child is known and thus, the mother's contribution to the 
child's genotype can be traced. Paternity testing investigates whether the part of the child's 
1 0 genotype not attributable to the mother is consistent with that of the putative father. Paternity 
testing can be performed by analyzing sets of polymorphisms in the putative father and the 
child. Of course, the present invention can be expanded to the use of this procedure to 
determine if one individual is related to another. Even more broadly, the present invention 
can be employed to determine how related one individual is to another, for example, between 

15 races or species. 

Bacterial infections are a major cause of health-related problems. However, 

the emergence of drug resistant bacteria is compromising the therapeutic value of the present 
spectrum of antibiotics. All the currently used antibiotics are small organic molecules, with 
certain level of structural similarity. This provides an advantage for bacteria to develop drug 
20 resistance, since they need to modify a limited number of genes in order to become resistant 
to a wide variety of antibiotics. The development of antibiotics with different chemical 
structure and targets can overcome antibiotic resistance, and provide therapeutic superiority 
in preventing infection by bacterial pathogens. Additionally, most antibiotics are not naturally 
occurring compounds and cause minor or sometimes serious side effects. For example, 
25 antibiotics used to treat TB can cause hearing loss. 

The present invention provides new antibacterial agents. Certain CLASP-2 
DNA sequences were difficult to clone and subclone {see Example 1). Bacteria harboring 
certain pieces of CLASP cDNA products were unable to be isolated, indicating that 
introduction of CLASP sequencescompromised bacterial viability. There can be at least two 
30 possible reasons why the CLASP cDNA were unable to be cloned, which can reflect a 
variation of the well-established Modification and Restriction systems found in bacteria 
(reviewed in Wilson and Murray. (1991) Annu. Rev. Genet. 25:585-627; Bickle and Kruger 
(1993) Microbiol. Rev. 57:29-67). This well-described system is used by bacteria to prevent 
deleterious effects caused by the introduction of foreign DNA. Bacteria can recognize 
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foreign DNA since it does not have the same modifications (e.g. methylation)~as the native 
DNA. After recognition, the bacteria then digest and eliminate the foreign DNA (restriction). 
In the firs! scenario, the CLASP cDNA can be recognized as foreign DNA, and digested and 
eliminated as in the Modification and Restriction system. However, this would be unique for 
5 CLASP cDNA since the bacteria used for cloning cDNA are compromised in the 
Modification and Restriction system, which makes cloning of cDNA into bacteria a practice 
common in the art. If this is the case, the bacterial apparatus that specifically recognizes or 
eliminates CLASP cDNA can provide a novel target to develop antimicrobial agents. The 
CLASP DNA sequence would be useful in targeting the apparatus as well as an entry point 

10 for designing screens to identify potential targets. The second possibility is that CLASP 
cDNA behaves as an antimicrobial agent (i.e., antibiotic), and prevents bacterial growth. 
This, in effect, would create a new type of antibiotic mediated by the presence of foreign 
DNA (i.e. CLASP cDNA). In the case for the CLASP cDNA, the bacteria can recognize the 
DNA but instead of digesting and eliminating the DNA, the CLASP cDNA can cause a 

15 variation of the restriction and prevent the bacteria from growing, imposing a bacteriacidal 
effect upon the bacteria. 

DNA as an antimicrobial agent has significant advantages over currently 
available agents. First, it is structurally unrelated to any existing antibiotics, and can 
overcome the present growing drug-resistance problem to structurally common agents. 

20 Second, since DNA antimicrobials composed of naturally-occurring human DNA, are 
expected to have minimal side effects and immune rejection. Third, DNA sequences can be 
tailored with sequence variation and numerous chemical modifications to circumvent the 
problem of resistance. Fourth, the antimicrobial DNA can be delivered specifically to 
bacterial cells through the use of bacteriophages (i.e., bacterial virus) which specifically 

25 infect bacteria and do not infect human cells. Further specificity can be generated to infect 
certain bacteria and bacterial subpopulations. Finally, this system can be economically robust 
since the generation of DNA and delivery vehicles are inexpensive. 

5.5. Polypeptides Encoded by the CLASP-2 Gene Coding Sequence 

In accordance with the invention, a CLASP-2 polynucleotide which encodes 
30 the CLASP-2 polypeptides, mutant polypeptides, peptide fragments, CLASP-2 fusion 
proteins or functional equivalents thereof, can be used to express CLASP-2 proteins in 
appropriate host cells. In various embodiments, the CLASP-2 polypeptides expressed will be 
identical or substantially similar to SEQ ID NOs: 2, 4, 6 or 10 or a fragment thereof. 
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In some embodiments, altered DNA sequences which can be used in 
accordance with the invention include deletions, additions or substitutions of different 
nucleotide residues resulting in a sequence that encodes the same or a functionally equivalent 
gene product. For example, due to the inherent degeneracy of the genetic code, other DNA 
sequences which encode substantially the same or a functionally equivalent amino acid 
sequence, can be used in the practice of the invention for the expression of the CLASP-2 
protein. Because of the degeneracy of the genetic code, a large number of functionally 
identical nucleic acids encode any given protein. For instance, the codoris GCA, GCC, GCG 
and GCU all encode the amino acid alanine. Thus, at every position where an alanine is 
specified by a codon, the codon can be altered to any of the corresponding codons described 
without altering the encoded polypeptide. Such nucleic acid variations are "silent variations,'* 
which are one species of conservatively modified variations. One of skill will recognize that 
each codon in a nucleic acid sequence such SEQ ID NO: 1 (except AUG, which is ordinarily 
the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) 
can be modified to yield a functionally identical molecule. Accordingly, each silent variation 
of a nucleic acid which encodes a polypeptide is implicit in each described sequence. Thus, 
for example, due to the degeneracy of the genetic code, a polypeptide having the sequence of 
SEQ ID NO: 2 or a fragment thereof, can be encoded by numerous polynucleotides other than 
SEQ ID NO: 1 . Typically, the degenerate sequence will hybridize with SEQ ID NO: 1 under 
high or moderate stringency conditions, but this is not strictly required (e.g., when a copy of a 
nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. 
In such cased, the nucleic acids typically hybridize under moderately stringent hybridization 
conditions.) 

The gene product itself can contain deletions, additions or substitutions of 
amino acidresidues within a CLASP-2 sequence, which result in a silent change thus 
producing a functionally equivalent CLASP-2 protein. Such conservative amino acid 
substitutions can be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved. For 
example, negatively charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine, histidine and arginine; amino acids with uncharged polar 
head groups having similar hydrophilicity values include the following: glycine, asparagine, 
glutamine, serine, threonine, tyrosine; and amino acids with nonpolar head groups include 
alanine, valine, isoleucine, leucine, phenylalanine, proline, methionine, tryptophan. 
Creighton, 1984, Proteins, has grouped amino acids that are conservative substitutions for 
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one another as follows: (1) Alanine (A), Glycine (G); (2) Aspartic acid (D), Glutamic acid 
(E); (3) Asparagine (N), Glutamine (Q); (4) Arginine (R), Lysine (K); (5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); (6) Phenylalanine (F), Tyrosine (Y), Tryptophan 
(W); (7) Serine (S), Threonine (T); and (8) Cysteine (C), Methionine (M). 
5 The DNA sequences of the invention can be engineered in order to alter a 

CLASP-2 coding sequence for a variety of ends, including but not limited to, alterations 
which modify processing and expression of the gene product. For example, mutations can be 
introduced using techniques which are well known in the art, e.g.. site-directed mutagenesis, 
to insert new restriction sites, to alter glycosylation patterns, phosphorylation, and the like. 

1 0 Based on the domain organization of the CLASP-2 proteins, a large number of CLASP-2 
mutant polypeptides can be constructed by modifying or rearranging the nucleotide 
sequences that encode the CLASP-2 extracellular, transmembrane and cytoplasmic domains. 

In various embodiments, the present invention provides homologues of the 
CLASP-2 polypeptides which function as either an CLASP-2 agonists or an CLASP-2 

1 5 antagonist. In a preferred embodiment, the CLASP-2 agonists and antagonists stimulate or 
inhibit, respectively, a subset of the biological activities of the naturally occurring form of the 
CLASP-2 polypeptide. Thus, specific biological effects can be elicited by treatment with a 
homologue of limited function. In one embodiment, treatment of a subject with a homologue 
having a subset of the biological activities of the naturally occurring form of the polypeptide 

20 has fewer side effects in a subject relative to treatment with the naturally occurring form of 
the CLASP-2 polypeptide. 

The invention contemplates both full-length CLASP-2 polypeptides and 
fragments, e.g., fragments having a length of at least about 10, often 20, frequently 50 or 100 
residues substantially identical to the exemplified CLASP-2 polypeptide sequences of the 

25 invention. Protein fragments can be "free-standing," or comprised within a larger 

polypeptide of which the fragment forms a part or region, most preferably as a single 
continuous region. Representative examples of polypeptide fragments of the invention, 
include, for example, fragments from about amino acid number 1-20, 2 1-40, 4 1-60, 61-80, 
81-100, 102-120, 121-140, 141-160, 161-180, 181-200, or 201 to the end of the coding 

30 region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 
• 110, 120, 130, 140, 150, 200 amino acids in length. In this context "about" includes the 
particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) amino acids, at either 
extreme or at both extremes. 
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Preferred polypeptide fragments include the CLASP-2 protein. Further 
preferred polypeptide fragments include the CLASP-2 protein having a continuous series of 
deleted residues from the amino or the carboxy terminus, or both. For example, any number 
of amino acids, ranging from 1-X, can be deleted from the amino terminus of either the 

5 CLASP-2 polypeptide. Furthermore, any combination of the above amino and carboxy 

terminus deletions arc preferred. Similarly, polynucleotide fragments encoding these CLASP- 
2 polypeptide fragments are also preferred. 

Even if deletion of one or more amino acids from the N-terminus of a protein 
results in modification of loss of one or more biological functions of the protein, other 

10 biological activities can still be retained. Thus, the ability of shortened CLASP-2 muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacki ng N-terminal residues of a complete polypeptide retains such immunologic 

15 activities can readily be determined by routine methods described herein and otherwise 

known in the art. It is not unlikely that a CLASP-2 mutein with a large number of deleted N- 
terminal amino acid residues can retain some biological or immunogenic activities. In fact, 
peptides composed of as few as four CLASP-2 amino acid residues can often evoke an 
immune response. 

20 Homologues of the CLASP-2 polypeptide can be generated by mutagenesis, 

e.g., discrete point mutation or truncation of the CLASP-2 polypeptide. As used herein, the 
term "homologue" refers to a variant form of the CLASP-2 polypeptide which acts as an 
agonist or antagonist of the activity of the CLASP-2 polypeptide. An agonist of the CLASP-2 
polypeptide can retain substantially the same, or a subset, of the biological activities of the 
25 CLASP-2 polypeptide. An antagonist of the CLASP-2 polypeptide can inhibit one or more of 
the activities of the naturally occurring form of the CLASP-2 polypeptide, by, for example, 
competitively binding to a downstream or upstream member of the CLASP-2 molecular 
pathway which includes the CLASP-2 polypeptide. 

Modulation can be assayed by determining any parameter that is indirectly or 
30 directly affected by the expression of the target gene. Such parameters include, e.g., changes 
in RNA or protein levels, changes in protein activity, changes in product levels, changes in 
downstream gene expression, changes in reporter gene transcription (luciferase, CAT, P- 
galactosidase, p-glucuronidase, GFP (see, e.g., Mistili & Spector, 1997, Nature 
Biotechnology 15: 961-964); changes in signal transduction, phosphorylation and 
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dephosphorylation, receplor-ligand interactions, second messenger concentrations {e.g., 
cGMP, cAMP, IP 3 , and Ca' ), and cell growth. These assays can be in v/7ro, in vivo, and ex 
vivo. Such functional effects can be measured by any means known to those skilled in the 
art, e.g., measurement of RNA or protein levels, measurement of RNA stability, identification 
5 of downstream or reporter gene expression, e.g., via chemiluminescence, fluorescence, 

colorimetric reactions, antibody binding, inducible markers, ligand binding assays; changes 
in intracellular second messengers such as cGMP and inositol triphosphate (IP3); changes in 
intracellular calcium levels; cytokine release, and the like. 

5.5.1. Synthesis or Expression of CLASP-2 Polypeptide Expression Systems 

10 In order to express a biologically active CLASP-2, the nucleotide sequence 

coding for CLASP-2, or a functional equivalent, is inserted into an appropriate expression 
vector. The CLASP-2 gene product as well as host cells or cell lines transfected or 
transformed with recombinant CLASP-2 expression vectors can be used for a variety of 
purposes. These include, but are not limited to, generating antibodies (i.e., monoclonal or 

1 5 polyclonal) that competitively inhibit activity of CLASP-2 protein and neutralize its activity; 
antibodies that activate CLASP-2 function and antibodies that detect its presence on the cell 
surface or in solution. Anti-CLASP-2 antibodies can be used in detecting and quantifying 
expression of CLASP-2 levels in cells and tissues such as lymphocytes and macrophages, as 
well as isolating CLASP-2-positive cells from a cell mixture. 

20 Methods which are well known to those skilled in the art can be used to 

construct recombinant expression vectors containing the CLASP-2 coding sequence and 
appropriate transcriptional/translational control signals. These methods include in vitro 
recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic 
recombination. (See, e.g., the techniques described in Sambrook et ah, 1989, Molecular 

25 Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y. and Ausubel et aL 9 . 
supra). The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that 
the recombinant expression vectors include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, which is operatively linked to the nucleic 

30 acid sequence to be expressed. It will be appreciated by those skilled in the art that the 

design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of polypeptide desired, and the like. The expression 
vectors of the invention can be introduced into host cells to thereby produce polypeptides or 
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peptides, including fusion polypeptides or peptides, encoded by nucleic acids as^described 
herein (e.g.. CLASP-2 polypeptides, mutant forms of CLASP-2, fusion polypeptides, and the 
like). 

A variety of host-expression vector systems can be utilized to express a 
5 CLASP-2 coding sequence. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA, plasmid DNA, or cosmid DNA 
expression vectors containing the CLASP-2 coding sequence; yeast transformed with 
recombinant yeast expression vectors containing the CLASP-2 coding sequence; insect cell 
systems infected with recombinant virus expression vectors {e.g., baculovirus) containing the 
1 0 CLASP-2 coding sequence; plant cell systems infected with recombinant virus expression 
vectors {e.g.. cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed 
with recombinant plasmid expression vectors {e.g., Ti plasmid) containing the CLASP-2 
coding sequence; or animal cell systems. The expression elements of these systems vary in 
their strength and specificities. Depending on the host/vector system utilized, any of a 
1 5 number of suitable transcription and translation elements, including constitutive and 

inducible promoters, can be used in the expression vector. For example, when cloning in 
bacterial systems, inducible promoters such as pL of bacteriophage X, plac, ptrp, ptac (ptrp- 
lac hybrid promoter; cytomegalovirus promoter) and the like can be used; when cloning in 
insect cell systems, promoters such as the baculovirus polyhedron promoter can be used; 
20 when cloning in plant cell systems, promoters derived from the genome of plant cells {e.g., 
heat shock promoters; the promoter for the small subunit of RUBISCO; the promoter for the 
chlorophyll oc/p binding protein) or from plant viruses {e.g., the 35 S RNA promoter of CaMV; 
the coat protein promoter of TMV) can be used; when cloning in mammalian cell systems, 
promoters derived from the genome of mammalian cells {e.g., metallothionein promoter) or 
25 from mammalian viruses {e.g., the adenovirus late promoter; the vaccinia virus 7.5K 

promoter) can be used; when generating cell lines that contain multiple copies of the CLASP- 
2 DNA, SV40-, BPV- and EBV-based vectors can be used with an appropriate selectable 
marker. 

In bacterial systems a number of expression vectors can be advantageously 
30 selected depending upon the use intended for the expressed CLASP-2 product. For example, 
when large quantities of CLASP-2 protein are to be produced for the generation of antibodies 
or to screen peptide libraries, vectors which direct the expression of high levels of fusion 
protein products that are readily purified can be desirable. Such vectors include, but are not 
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limited to, the E. coli expression vector pUR278 (Ruther et aL 9 1983, EMBO J. T: 1791), in 
which the CLASP-2 coding sequence can be ligated into the vector in frame with the lacZ 
coding region so that a hybrid protein is produced; pIN vectors (Inouye & Inouye, 1985, 
Nucleic acids Res. 13: 3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 264: 5503- 
5 5509); and the like. pGEX vectors may also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads 
followed by elution in the presence of free glutathione. Proteins made in such systems may be 
designed to include heparin, thrombin, or factor XA protease cleavage sites so that the cloned 

1 0 polypeptide of interest can be released from the GST moiety at will. 

In yeast, a number of vectors containing constitutive or inducible promoters 
can be used. (Current Protocols in Molecular Biology, Vol. 2, 1988 (Suppl. 1999), Ed. 
Ausubel et aL, Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et aL, 1987, 
Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & 

15 Grossman, 1987, Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA Cloning, 
Vol. H, IRJL Press, Wash., DC, Ch. 3; and Bitter, 1987, Heterologous Gene Expression in 
Yeast, Methods in Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, pp. 
673-684; and The Molecular Biology of the Yeast Saccharomyces, 1982, Eds. Strathern et 
aL, Cold Spring Harbor Press, Vols. I and II.) 

20 In cases where plant expression vectors are used, the expression of the 

CLASP-2 coding sequence can be driven by any of a number of promoters. For example, 
viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al, 
1984, Nature 310: 511-514), or the coat protein promoter of TMV (Takamatsu et aL, 1987, 
EMBO J. 6: 307-31 1) can be used; alternatively, plant promoters such as the small subunit of 

25 RUBISCO (Coruzzi et aL, 1984, EMBO J. 3: 1671-1680; Broglie et aL, 1984, Science 224: 
838-843); or heat shock promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley et al, 
1986, Mol. Cell. Biol. 6: 559-565) can be used. These constructs can be introduced into plant 
cells using Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, 
microinjection, electroporation, and the like. (Weissbach & Weissbach, 1988, Methods for 

30 Plant Molecular Biology, Academic Press, NY, Section Vm, pp. 421-463; and Grierson & 
Corey, 1988, Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9.) 

An alternative expression system which could be used to express CLASP-2 is 
an insect system. In one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes. The virus grows in Spodoptera 
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frugiperda cells. The CLASP-2 coding sequence can be cloned into non-essential regions 
{e.g., the polyhedron gene) of the virus and placed under control of an AcNPV promoter 
(e.g., the polyhedron promoter). Successful insertion of the CLASP-2 coding sequence will 
result in inactivation of the polyhedron gene and production of non-occluded recombinant 
virus (i.e., virus lacking the proteinaceous coat coded for by the polyhedron gene). These 
recombinant viruses are then used to infect Spodoptera frugiperda cells in which the inserted 
gene is expressed, {see, e.g., Smith et al, 1983, J. VioL 46: 584; Smith, U.S. Patent No. 
4,215,051). 

In mammalian host cells, a number of viral based expression systems can be 
utilized. In cases where an adenovirus is used as an expression vector, the CLASP-2 coding 
sequence can be ligated to an adenovims transcription/translation control complex, e.g., the 
late promoter and tripartite leader sequence. This chimeric gene can then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable and 
capable of expressing CLASP-2 in infected hosts. (See, e.g., Logan & Shenk, 1984, Proc. 
Natl. Acad. Sci. U.S.A. 81: 3655-3659). Alternatively, the vaccinia 7.5K promoter can be 
used. (See, e.g., Mackett et al, 1982, Proc. Natl. Acad. Sci. U.S.A. 79: 7415-7419; Mackett 
et al, 1984, J. Virol. 49: 857-864; Panicali et al, 1982, Proc. Natl. Acad. Sci. U.S.A. 79: 
4927-4931). Regulatable expression vectors such as the tetracycline repressible vectors can 
also be used to express a coding sequence in a controlled fashion. 

Specific initiation signals can also be required for efficient translation of 
inserted CLASP-2 coding sequences. These signals include the ATG initiation codon and 
adjacent sequences. In cases where the entire CLASP-2 gene, including its own initiation 
codon and adjacent sequences, is inserted into the appropriate expression vector, no 
additional translational control signals can be needed. However, in cases where only a 
portion of the CLASP-2 coding sequence is inserted, exogenous translational control signals, 
including the ATG initiation codon, must be provided. Furthermore, the initiation codon 
must be in phase with the reading frame of the CLASP-2 coding sequence to ensure 
translation of the entire insert. These exogenous translational control signals and initiation 
codons can be of a variety of origins, both natural and synthetic. The efficiency of expression 
can be enhanced by the inclusion of appropriate transcription enhancer elements, 
transcription terminators, and the like, (see Bittner et al, 1987, Methods in Enzymol. 153: 
516-544). 
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In addition, a host cell strain can be chosen which modulates the expression of 
the inserted sequences, or modifies and processes the gene product in a specific fashion 
desired. Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein 
products can be important for the function of the protein. The presence of several consensus 
5 N-glycosylation sites in CLASP-2 extracellular domains support the possibility that proper 
modification can play a role in CLASP-2 function. Different host cells have characteristic 
and specific mechanisms for the post-translational processing and modification of proteins. 
Appropriate cell lines or host systems can be chosen to ensure the correct modification and 
processing of the foreign protein expressed. To this end, eukaryotic host cells which possess 
10 the cellular machinery for proper processing of the primary transcript, glycosylation, and 

phosphorylation of the gene product can be used. Such mammalian host cells include, but are 
not limited to, CHO, VERO, BHK, HeLa, GGS, MDGK, 293, WI38, and the like. 

Host cells transformed with nucleotide sequences encoding CLASP-2 may be 
cultured under conditions suitable for the expression and recovery of the soluble protein from 
15 cell culture. The protein produced by a transformed cell may be secreted or contained 

intracellularly depending on the sequence and/or the vector used. As will be understood by 
those of skill in the art, expression vectors containing polynucleotides which encode CLASP- 
2 may be designed to contain signal sequences which direct secretion of CLASP-2 through a 
prokaryotic or eukaryotic cell membrane. Other constructions may be used to join sequences 
20 encoding CLASP-2 to nucleotide sequence encoding a polypeptide domain which will 

facilitate purification of soluble proteins. Such purification facilitating domains include, but 
are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, 
25 _ For long-term, high-yield production of recombinant proteins, stable 

expression is preferred. For example, cell lines which stably express CLASP-2 proteins can 
be engineered. Rather than using expression vectors which contain viral origins of 
replication, host cells can be transformed with the CLASP-2 DNA controlled by appropriate 
expression control elements (e.g. r promoter, enhancer, sequences, transcription terminators, 
30 polyadenylation sites, and the like.), and a selectable marker. Following the introduction of 
foreign DNA, engineered cells can be allowed to grow for 1-2 days in an enriched medium, 
and then switched to a selective medium. The selectable marker in the recombinant plasmid 
confers resistance to the selection and allows cells to stably integrate the plasmid into their 
chromosomes and grow to form foci which in turn can be cloned and expanded into cell lines. 
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This method can advantageously be used to engineer cell lines which express the CLASP-2 
protein(s) on the cell surface. Such engineered cell lines are particularly useful in screening 
for molecules or drugs that affect CLASP-2 function. 

A number of selection systems can be used, including but not limited to, the 
herpes simplex virus thymidine kinase (Wigler et aL, 1977, Cell 11: 223), hypoxanthine- 
guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl. Acad. Sci; 
U.S.A. 48: 2026), and adenine phosphoribosyltransferase (Lowy et aL, 1980, Cell 22: 817) 
genes which can be employed in tk", hgprt* or aprt* cells, respectively. Also, antimetabolite 
resistance can be used as the basis of selection for dhfr 9 which confers resistance to 
methotrexate (Wigler et aL, 1980, Natl. Acad. Sci. U.S.A. 77: 3567; O'Hare et aL, 1981, 
Proc. Natl. Acad. Sci. U.S.A. 78: 1527); gpt y which confers resistance to mycophenolic acid 
(Mulligan & Berg, 1981), Proc. Natl. Acad. Sci. U.S.A. 78: 2072); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin et aL, 1981, J. Mol. Biol. 150: 1); 
and hygro, which confers resistance to hygromycin (Santerre et aL, 1984, Gene 30: 147). 
Additional selectable genes have been described, namely trpB, which allows cells to utilize 
indole in place of tryptophan; hisD, which allows cells to utilize histinol in place of histidine 
(Hartman & Mulligan, 1988, Proc. Natl. Acad. Sci. U.S.A. 85: 8047); ODC (ornithine 
decarboxylase) which confers resistance to the ornithine decarboxylase inhibitor, 2- 
(difluoromethyl)-DL-ornithine, DFMO (McConlogue L., 1987, In: Current Communications 
in Molecular Biology, Cold Spring Harbor Laboratory ed.) and glutamine synthetase 
(Bebbington et aL, 1992, Biotech 10: 169). 

In an alternate embodiment of the invention, the coding sequence of CLASP-2 
could be synthesized in whole or in part, using chemical methods well known in the art. (See, 
e.g., Caruthers et aL, 1980, Nuc. Acids Res. Symp. Ser. 7: 215-233; Crea and Horn, 180, 
Nuc. Acids Res. 9(10): 2331; Matteucci and Caruthers, 1980, Tetrahedron Letter 21: 719; and 
Chow and Kempe, 1981, Nuc. Acids Res. 9(12): 2807-2817.) Alternatively, the protein itself 
could be produced using chemical methods to synthesize a CLASP-2 amino acid sequence in 
whole or in part. For example, peptides can be synthesized by solid phase techniques, 
cleaved from the resin, and purified by preparative high performance liquid chromatography. 
(See Creighton, 1983, Proteins Structures And Molecular Principles, W.H. Freeman and Co., 
N.Y. pp. 50-60). The composition of the synthetic polypeptides can be confirmed by amino 
acid analysis or sequencing (e.g., the Edman degradation procedure; see Creighton, 1983, 
Proteins, Structures and Molecular Principles, W.H. Freeman and Co., N.Y., pp. 34-49). 
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In some embodiments, the CLASP-2 polypeptide contains non-naturally 
occurring amino acids or amino acid analogs (Le. 9 compounds that have the same basic 
chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a 
hydrogen, a carboxyl group, an amino group, and an R group, e.g. t homoserine, norleucine, 
5 methionine sulfoxide, methionine methyl sulfonium). 

5.5.2. Identification of Cells That Express CLASP-2 

The recombinant host cells which contain the coding sequence and which 
express a CLASP-2 gene product or fragments thereof can be identified by at least four 
general approaches; (a) DNA-DNA or DNA-RNA hybridization; (b) the presence or absence 

10 of "marker" gene functions; (c) assessing the level of transcription as measured by the 
expression of CLASP-2 mRNA transcripts in the host cell; and (d) detection of the gene 
product as measured by immunoassay or by its biological activity. Prior to the identification 
of gene expression, the host cells can be first mutagenized in an effort to increase the level of 
expression of CLASP-2, especially in cell lines that produce low amounts of CLASP-2. 

15 In the first approach, the presence of the CLASP-2 coding sequence inserted 

in the expression vector can be detected by DNA-DNA or DNA-RNA hybridization using 
probes comprising nucleotide sequences that are homologous to the CLASP-2 coding 
sequence, respectively, or portions or derivatives thereof. 

In the second approach, the recombinant expression vector/host system can be 

20 identified and selected based upon the presence or absence of certain "marker" gene 

functions (e.g., thymidine kinase activity, resistance to antibiotics, resistance to methotrexate, 
transformation phenotype, occlusion body formation in baculovirus, and the like). For 
example, if the CLASP-2 coding sequence is inserted within a marker gene sequence of the 
vector, recombinants containing the CLASP-2 coding sequence can be identified by the 

25 absence of the marker gene function. Alternatively, a marker gene can be placed in tandem 
with the CLASP-2 sequence under the control of the same or different promoter used to 
control the expression of the CLASP-2 coding sequence. Expression of the marker in 
response to induction or selection indicates expression of the CLASP-2 coding sequence. 

In the third approach, transcriptional activity for the CLASP-2 coding region 

30 can be assessed by hybridization assays. For example, RNA can be isolated and analyzed by 
Northern blot using a probe homologous to the CLASP-2 coding sequence or particular 
portions thereof. Alternatively, total nucleic acids of the host cell can be extracted and 
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assayed for hybridization to such probes. Additionally, reverse transcription-pqlymerase 
chain reactions can be used to detect low levels of gene expression. 

In the fourth approach, the expression of the CLASP-2 protein product can be 
assessed immunologically, for example by Western blots, immunoassays such as 
5 radioimmuno-precipitation, enzyme-linked immunoassays, fluorescent activated cell sorting 
CTACS"), and the like. This can be achieved by using an anti-CLASP-2 antibody. 
Alternatively, CLASP-2 protein can be expressed as a fusion protein with green-fluorescent 
protein to facilitate its detection in cells (United States Patent Nos. 5,491,084; 5,804,387; 
5,777,079). 

10 Identification of cells or tissues expressing CLASP protein ormRNA, 

especially CLASP-2 isoforms, can be useful for determining normal and abnormal CLASP 
expression in a given cell or tissue. As discussed above, a number of CLASP-2 isoforms 
have been identified, e.g., in Jurkat cells, peripheral blood, and brain. The identification of 
mRNA or protein expression in various cell types and tissues can allow for identification of 
15 isoforms improperly expressed in either a spatial or temporal manner. Expression of 

hCLASP-2D isoform in hematopoietic cells may cause problems due to the presence of the 
SH3 domain not seen in the Jurkat and peripheral blood isoforms. 

Other molecules in the immune system may also interact with portions of 
hCLASP2D. However, the absence of the PBM domain in the hCLASP-2D isoform may be 
20 necessary for function in certain cell types or tissues. Similarly, expression of CLASP 

isoforms 2A, 2B, and 2C in brain may cause problems for different reasons: the PBM present 
in these isoforms may interfere with a particular function by binding any of the known PDZ 
domain protein involved in formation of the neurological synapse. Similarly, the lack of an 
SH3 domain may cause an inappropriate response due to interactions with only a subset of 
25 molecules required for CLASP-2 function in the brain. 

5.5.3. Uses of CLASP-2 Engineered Host Cells 

In one embodiment of the invention, the CLASP-2 protein and/or cell lines 
that express CLASP-2 can be used to screen for antibodies, peptides, small molecules, natural 
and synthetic compounds or other cell bound or soluble molecules that bind to the CLASP-2 
30 protein resulting in stimulation or inhibition of CLASP-2 function. For example, anti- 
CLASP-2 antibodies can be used to inhibit or stimulate CLASP-2 function and to detect its 
presence. Alternatively, screening of peptide libraries with recombrnantly expressed soluble 
CLASP-2 protein or cell lines expressing CLASP-2 protein can be useful for identification of 
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therapeutic molecules that function by inhibiting or stimulating the biological activity of 
CLASP-2. The uses of the CLASP-2 protein and engineered cell lines, described in the 
subsections below, can be employed equally well for homologous CLASP-2 genes in various 
species. 

5 In a specific embodiment of the invention, cell lines may be engineered to 

express the extracellular or intracellular domain of CLASP fused to another molecule such as 
GST. In addition, CLASP, its extracellular domain or its intracellular domain may be fused 
to an immunoglobulin constant region (Hollenbaugh and Aruffo, 1992, Current Protocols in 
Immunology, Unit 1 0 1 9; Aruffo et al, 1990, Cell 61 : 1303) to produce a soluble molecule 

10 with increased half life. The soluble protein or fusion protein can be used in binding assays, 
affinity chromatography, irnmunoprecipitation, Western blot, and the like. Synthetic 
compounds, natural products, and other sources of potentially biologically active materials 
can be screened in assavs that are well known in the art. 

Random peptide libraries consisting of all possible combinations of amino 

15 acids attached to a solid phase support can be used to identify peptides that are able to bind to 
a specific domain of CLASP-2 (Lam, K.S. et al, 1991, Nature 354: 82-84). The screening of 
peptide libraries can have therapeutic value in the discovery of pharmaceutical agents that 
stimulate or inhibit the biological activity of CLASP-2. 

Identification of molecules that are able to bind to the CLASP-2 protein can be 

20 accomplished by screening a peptide library with recombinant soluble CLASP-2 protein. 

Methods for expression and purification of CLASP-2 are described in Section 5.7, supra, and 
can be used to express recombinant full length CLASP-2 or fragments of CLASP-2 
depending on the functional domains of interest. Such domains include CLASP-2 
extracellular domain, transmembrane domain, CLASP-2 intracellular domain, IT AM 

25 containing domain, tyrosine phosphorylation site containing domain, cysteine cluster 
containing domain, cadhcrin motif containing domain, and coil/coil domain. 

To identify and isolate the peptide/solid phase support that interacts and forms 
a complex with CLASP-2, it is necessary to label or "tag" the CLASP-2 molecule. The 
CLASP-2 protein can be conjugated to enzymes such as alkaline phosphatase or horseradish 

30 peroxidase or to other reagents such as fluorescent labels which can include fluorescein 

isothiocyanate (FITC), phycoerythrin (PE) or rhodamine. Conjugation of any given label to 
CLASP-2 can be performed using techniques that are well known in the art. Alternatively, 
CLASP-2 expression vectors can be engineered to express a chimeric CLASP-2 protein 
containing an epitope for which a commercially available antibody exist. The epitope- 
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specific antibody can be tagged with a detectable label using methods well known in the art 
including an enzyme, a fluorescent dye or colored or magnetic beads. 

The "tagged" CLASP-2 conjugate is incubated with the random peptide 
library for 30 minutes to one hour at 22°C to allow complex formation between CLASP-2 
5 and peptide species within the library. The library is then washed to remove any unbound 
protein. If CLASP-2 has been conjugated to alkaline phosphatase or horseradish peroxidase 
the whole library is poured into a petri dish containing substrates for either alkaline 
phosphatase or peroxidase, for example, 5-bromo-4-chloro-3-indoyl phosphate (BCIP) or 
3,3\4,4"-diaminobenzidine (DAB), respectively. After incubating for several minutes, the 
10 peptide/solid phase- CLASP-2 complex changes color, and can be easily identified and 

isolated physically under a dissecting microscope with a micromanipulator. If a fluorescent 
tagged CLASP-2 molecule has been used, complexes can be isolated by fluorescence 
activated sorting. If a chimeric CLASP-2 protein*expressing a heterologous epitope has been 
used, detection of the peptide/CLASP-2 complex can be accomplished by using a labeled 
1 5 epitope-specific antibody. Once isolated, the identity of the peptide attached to the solid 
phase support can be determined by peptide sequencing. 

In addition to using soluble CLASP-2 molecules, in another embodiment, it is 
possible to detect peptides that bind to cell-associated CLASP-2 using intact cells. The use of 
intact cells is preferred for use with cell surface molecules. Methods for generating cell lines 
20 expressing CLASP-2 are described in Section 5.8. The cells used in this technique can be 

either live or fixed cells. The cells can be incubated with the random peptide library and bind 
to certain peptides in the library to form a "r osette" between the target cells and the relevant 
solid phase support/peptide. The rosette can thereafter be isolated by differential 
centrifugation or removed physically under a dissecting microscope. Techniques for 
25 screening combinatorial libraries are known in the art (Gallop et al, 1994, J. Med. Chem., 37: 
1233; Gordon, 1994, J. Med. Chem., 37: 1385). 

As an alternative to whole cell assays for membrane bound receptors or 
receptors that require the lipid domain of the cell membrane to be functional, CLASP-2 
molecules can be reconstituted into liposomes where label or 'tag" can be attached. 

30 5.5.4. CLASP-2 Fusion Proteins 

In another embodiment of the invention, a CLASP-2 or a modified CLASP-2 
sequence can be ligated to a heterologous sequence to encode a fusion protein. For example, 
for screening of peptide libraries for molecules that bind CLASP-2, it can be useful to 
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produce a chimeric CLASP-2 protein expressing a heterologous epitope that is recognized by 
a commercially available antibody. A fusion protein can also be engineered to contain a 
cleavage site located between a CLASP-2 sequence and the heterologous protein sequence, 
so that the CLASP-2 can be cleaved away from the heterologous moiety. In one embodiment, 
fusion proteins of the invention can contain the CLASP-2 extracellular domain comprising at 
least about residues 1 through 816 or fragment thereof. In another embodiment, fusion 
proteins can contain the CLASP-2 intracellular domain comprising at least about residue 843 
through the end of the CLASP-2 sequence or fragment thereof. 

5.6. Cloning Alleles, Variants, and Species Homologs of CLASP-2 

In order to clone the full length cDNA sequence from any species encoding a 
CLASP-2 cDNA, or to clone variant forms of the molecule, labeled DNA probes made from 
nucleic acid fragments corresponding to any partial cDNA disclosed herein can be used to 
screen a cDNA library derived from lymphoid cells or brain cells. More specifically, 
oligonucleotides corresponding to either the 5' or 3* terminus of the cDNA sequence can be 
used to obtain longer nucleotide sequences. Briefly, the library can be plated out to yield a 
maximum of 30,000 pfu for each 150 mm plate. Approximately 40 plates can be screened. 
The plates are incubated at 37°C until the plaques reach a diameter of 0.25 mm or are just 
beginning to make contact with one another (3-8 hours). Nylon filters are placed onto the 
soft top agarose and after 60 seconds, the filters are peeled off and floated on a DNA 
denaturing solution consisting of 0.4N sodium hydroxide. The filters are then immersed in 
neutralizing solution consisting of 1M Tris-HCl, pH 7.5, before being allowed to air dry. The 
filters are prehybridized in hybridization buffer such as casein buffer containing 10% dextran 
sulfate, 0.5M NaCl, 50mM Tris-HCl, pH 7.5, 0.1% sodium pyrophosphate, 1% casein, 1% 
SDS, and denatured salmon sperm DNA at 0.5 mg/ml for 6 hours at 60°C. The radiolabeled 
probe is then denatured by heating to 95°C for 2 minutes and then added to the 
prehybridization solution containing the filters. The filters are hybridized at 60°C for 16 
hours. The filters are then washed in IX wash mix (10X wash mix contains 3M NaCl, 0.6M 
Tris base, and 0.02M EDTA) twice for 5 minutes each at room temperature, then in IX wash 
mix containing 1% SDS at 60°C for 30 minutes, and finally in 0.3X wash mix containing 
0.1% SDS at 60°C for 30 minutes. The filters are then air dried and exposed to x-ray film for 
autoradiography. After developing, the film is aligned with the filters to select a positive 
plaque. If a single, isolated positive plaque cannot be obtained, the agar plug containing the 
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plaques will be removed and placed in lambda dilution buffer containing 0.1M NaCI, 0.01M 
magnesium sulfate, 0.035M Tris HC1, pH 7.5, 0.01% gelatin. The phage can then be replated 
and rescreencd to obtain single, well isolated positive plaques. Positive plaques can be 
isolated and the cDN A clones sequenced using primers based on the known cDNA sequence. 

5 This step can be repeated until a full length cDNA is obtained. 

It can be necessary to screen multiple cDNA libraries from different tissues to 
obtain a full length cDNA. In the event that it is difficult to identify cDNA clones encoding 
the complete 5' terminal coding region, an often encountered situation in cDNA cloning, the 
RACE (Rapid Amplification of cDNA Ends) technique can be used. RACE is a proven PCR- 

10 based strategy for amplifying the 5* end of incomplete cDNAs. S'-RACE-Ready RNA 
synthesized from human tissues containing a unique anchor sequence is commercially 
available (Clontcch). To obtain the 5' end of the cDNA, PCR is carried out on 5'-RACE- 
Ready cDN A using the provided anchor primer and the 3' primer. A secondary PCR reaction 
is then carried out using the anchored primer and a nested 3' primer according to the 

15 manufacturer's instructions. Once obtained, the fixll length cDNA sequence can be translated 
into amino acid sequence and examined for certain landmarks such as a continuous open 
reading frame flanked by translation initiation and termination sites, a cadherin-like domain, 
an ITAM domain, a tyrosine phosphorylation site, a cysteine cluster, a transmembrane 
domain, and finally overall structural similarity to the CLASP-2 genes disclosed herein. See, 

20 Ponassi et al, 1999, Mech. Dev. 80: 207-212; Isakov, 1998, Receptor Channels 5: 243-253; 
Borroto et aU 1997, Biopolymers 42: 75-88; Dimitratos et al t 1997, Mech. Dev. 63: 127- 
130; Apperson et aL, 1996, J. Neurosci. 16: 6839-6852; Ozawa et aL, 1990, Mech. Dev. 33: 
49-56, which discuss protein domains and are incorporated herein by reference. 

5.7. Modulating Expression of Endogenous CLASP-2 Genes 

25 Alternatively, the expression characteristics of an endogenous CLASP-2 gene 

within a cell population can be modified by inserting a heterologous DNA regulatory element 
into the genome of the cell line such that the inserted regulatory element is operatively linked 
with the endogenous CLASP-2 gene. For example, an endogenous CLASP-2 gene which is 
normally 'transcriptionally silent", i.e., an CLASP-2 gene which is normally not expressed, or 

30 is expressed only at very low levels in a cell population, can be activated by inserting a 
regulatory element which is capable of promoting the expression of a normally expressed 
gene product in the cells. Alternatively, a transcriptionally silent, endogenous CLASP-2 gene 
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can be activated by insertion of a promiscuous regulatory element that works across cell 
types. . 

A heterologous regulatory element can be inserted into a cell line population, 
such that it is operatively linked with an endogenous CLASP-2 gene, using techniques, such 
5 as targeted homologous recombination, which are well known to those of skill in the art, (see 
e.g.. in Chappel, U.S. Patent No. 5,272,071 ; PCT publication No. WO 91/06667, published 
Jan 16, 1991). 

5.8. Anti-CLASP-2 Antibodies 

Various procedures known in the art can be used for the production of 

10 antibodies to epitopes of the natural and recombinantly produced CLASP-2 protein. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 
human or humanized, IgG, IgM, IgA, IgD or IgE, a complementarity determining region, Fab 
fragments, F(ab') 2 and fragments produced by an Fab expression library as well as anti- 
idiotype antibodies. Antibodies which compete for CLASP-2 binding are especially 

1 5 preferred for diagnostics and therapeutics. 

Monoclonal antibodies that bind CLASP-2 can be radioactively labeled 
allowing one to follow their location and distribution in the body after injection. 
Radioisotope tagged antibodies can be used as a non-invasive diagnostic tool for imaging de 
novo lymphoid tumors and metastases that express CLASP-2. 

20 Immunotoxins can also be designed which target cytotoxic agents to specific 

sites in the body. For example, high affinity CLASP-2 specific monoclonal antibodies can be 
covalently complexed to bacterial or plant toxins, such as diphtheria toxin or ricin. A general 
method of preparation of antibody/hybrid molecules can involve use of thiol-crosslinking 
reagents such as SPDP, which attack the primary amino groups on the antibody and by 

25 disulfide exchange, attach the toxin to the antibody. The hybrid antibodies can be used to 
specifically eliminate CLASP-2 expressing lymphocytes. 

For the production of antibodies, various host animals can be immunized by 
injection with the recombinant or naturally purified CLASP-2 protein, fusion protein or 
peptides, including but not limited to goats, rabbits, mice, rats, hamsters, and the like 

30 Various adjuvants can be used to increase the immunological response, depending on the host 
species, including but not limited to Freund's (complete and incomplete), mineral gels such 
as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and poten- 
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tially useful human adjuvants such as BCG (bacilli Calmette-Guerin) and Corynebacterium 
parvum. 

Monoclonal antibodies to CLASP-2 can be prepared by using any technique 
which provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique originally described by Kohler 
and Milstein, {Nature, 1975, 256: 495-497), the human B-cell hybridoma technique (Kosbor 
et al, 1983, Immunology Today, 4: 72; Cote et al, 1983, Proc. Natl. Acad. Sci. U.S .A., 80: 
2026-2030) and the EBV-hybridoma technique (Cole et al, 1985, Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al, 1984, Proc. Natl. Acad. Sci. U.S.A., 81: 
6851-6855; Neuberger et al, 1984, Nature, 312: 604-608; Takeda et al, 1985, Nature, 314: 
452-454) by splicing the genes from a mouse antibody molecule of appropriate antigen 
specificity together with genes from a human antibody molecule of appropriate biological 
activity can be used. Alternatively, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce CLASP-2 -specific single chain 
antibodies. In some embodiments, phage display technology is used to identify antibodies 
and heteromeric Fab fragments that specifically bind to selected antigens (see, e.g., 
McCafferty et al, Nature 348: 552-554 (1990); Marks et al, Biotechnology 10: 779-783 
(1992)). 

Hybridomas can be screened using enzyme-linked immunosorbent assays 
(ELIS A) in order to detect cultures secreting antibodies specific for refolded recombinant 
CLASP-2. Cultures can also be screened by ELIS A to identify those cultures secreting 
antibodies specific for mammalian-produced CLASP-2. Confirmation of antibody specificity 
can be obtained by western blot using the same antigens. Subsequent ELIS A testing can use 
recombinant CLASP-2 fragments to identify the specific portion of the CLASP-2 molecule 
with which a monoclonal antibody binds. Additional testing can be used to identify 
monoclonal antibodies with desired functional characteristics such as staining of histological 
sections, immunoprecipitation of CLASP-2, inhibition of CLASP-2 binding or stimulation of 
CLASP-2 to transmit an intracellular signal. Determination of the monoclonal antibody 
isotype can be accomplished by ELISA, thus providing additional information concerning 
purification or function. 

Some anti-CLASP-2 monoclonal antibodies of the present invention are 
humanized, human or chimeric, in order to reduce their potential antigenicity, without 
reducing their affinity for their target. Humanized antibodies have been described in the art. 
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See, e.g., Queen, et ah, 1989, Proc. Natl Acad. Sci. U.S.A. 86: 10029; U.S. PatenTNos. 
5,563,762; 5,693,761; 5,585,089 and 5,530,101. The human antibody sequences used for 
humanization can be the sequences of naturally occurring human antibodies or can be 
consensus sequences of several human antibodies. See Kettleborough et al, 1991, Protein 
5 Engineering 4: 773; Kolbinger et al. 9 1993, Protein Engineering 6: 971. Humanized 
monoclonal antibodies against CLASP-2 peptides can also be produced using transgenic 
animals having elements of a human immune system (see, e.g., U.S. Patent Nos. 5,569,825; 
5,545,806; 5,693,762; 5,693,761; and 5,7124,350). 

In some embodiments, an anti-CLASP-2 polypeptide monoclonal or 

10 polyclonal antiserum is produced that is specifically immunoreactive with a particular 

CLASP-2 polypeptide and is selected to have low cross-reactivity against other molecules 
{e.g., other CLASP polypeptides) and any such cross-reactivity is removed by 
immunoabsorbtion prior to use in the immunoassay. Methods for screening and 
characterizing monoclonal antibodies for specificity are well known in the art and are 

15 described generally in Harlow and Lane, supra. For example, polyclonal antibodies raised to 
hCLASP-2A, as shown in SEQ ID NO: 1, or splice variants, or immunogenic portions 
thereof, can be selected to obtain only those polyclonal or monoclonal antibodies that are 
specifically immunoreactive with the target protein not with other proteins. This selection 
may be achieved by subtracting out antibodies that cross-react with molecules. A variety of 

20 immunoassay formats may be used to select antibodies specifically immunoreactive with a 
particular protein. For example, solid-phase ELISA immunoassays are routinely used to 
select antibodies specifically immunoreactive with a protein (see, e.g., Harlow & Lane, 
Antibodies, A Laboratory Manual (1988) for a description of immunoassay formats and 
conditions that can be used to determine specific immunoreactivity). Typically a specific or 

25 selective reaction will be at least twice background signal or noise and more typically more 
than 10 to 100 times background. Alternatively, antibodies that cross-react with a selected set 
of polypeptides may be prepared. 

Antibody fragments which contain specific binding sites of V can be 
generated by known techniques. For example, such fragments include, but are not limited to, 
30 the F(ab')2 fragments which can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the disulfide bridges of the 
F(ab')2 fragments. Alternatively, Fab expression libraries can be constructed (Huse et al, 
1989, Science, 246: 1275-1281) to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity to CLASP-2. 
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Anti-CLASP-2 antibodies can also be used to identify, isolate, inhibit or 
eliminate CLASP-2-expressing cells. In one embodiment, the present invention includes a 
method of identifying an abnormal T cell profile of an immunocompromised subject relative 
to the T cell profile of a non-immunocompromised subject. The method includes (i) sorting a 
sample of peripheral blood mononuclear cells (PBMC) isolated from the 
immunocompromised subject into sets of T cell types, (ii) determining the ratio of CLASP-2* 
cells relative to the total number of cells (CLASP-2 + : total) in each set, and identifying an 
abnormal T cell profile in the immunocompromised subject by comparing the CLASP-2 + : 
total ratios of sets from the immunocompromised subject with the CLASP-2 + : total ratios of 
analogous sets from a non-immunocompromised subject. 

In other embodiments, anti-CLASP-2 antibodies can be used for detection of 
hCLASP-2 protein in assays such as fluorescent activated cell sorting (FACS), ELISA, 
fluorescent or electron immunomicroscopy, Western blots, gel shift analyses. CLASP-2 
expression in various cells, localization within cells, interactions with other proteins, and 
differentiation between CLASP-2 isoform expression can be determined by use of the 
techniques listed herein. 

5.9. Screening Assays 

The invention provides methods for identifying compounds or agents that 
modulate {i.e., inhibit or enhance) CLASP-2 expression or activity. CLASP-2 expression or 
activity modulators are useful for treatment of disorders characterized by (or associated with) 
aberrant or abnormal CLASP-2 expression or activity. Aberrant expression of CLASP-2 
mRNA or protein means expression in lymphocytes (e.g., T lymphocytes or B lymphocytes) 
or other CLASP-2 expressing cells of at least 2-fold, preferably at least 5-fold greater than 
expression in control lymphocytes obtained from a healthy subject. 

The CLASP-2 expression assays can include the steps of contacting a cell 
expressing CLASP-2 with a compound or agent and assaying CLASP-2 expression. CLASP- 
2 polypeptide expression is easily measured by ELISA using anti-CLASP-2 antibodies of the 
invention. CLASP-2 mRNA expression (including expression of specific species or splice 
variants of CLASP-2) can be measured by quantitative Northern analysis or quantitative 
PCR. 

CLASP-2 activities include, for example, the CLASP-2 polypeptide binding to 
PDZ-domain containing molecules and CLASP-2 polypeptide involvement in signal 
transduction (e.g., leading to T cell activation). Compounds or agents that modulate the 
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interaction of a CLASP-2 polypeptide and a target molecule, modulate CLASP-2 nucleic acid 
expression, or modulate CLASP-2 polypeptide activity are all contemplated by the methods 
of the present invention. 

Test compounds include, for example, 1) peptides (e.g., soluble peptides, 
5 including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam, 
K. S. et al, 1991, Nature 354: 82-84; Houghten, R. et al, 1991, Nature 354: 84-86) and 
combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration 
amino acids; 2) phosphopeptides (e.g., members of random and partially degenerate, directed 
phosphopeptide libraries, see, e.g., Songyang, Z. et al, 1993, Cell 72: 767-778); 3) CLASP-2 

10 antibodies (as described above); 4) small organic and inorganic molecules (e.g., molecules 
obtained from combinatorial and natural product libraries); 5) antisense RNA and DNA 
molecules arid ribozymes (described above). 

The CLASP modulators can be any of a large variety of compounds, both 
naturally occurring and synthetic, organic and inorganic, and including polymers (e.g., 

15 oligopeptides, polypeptides, oligonucleotides, and polynucleotides), small molecules, 
antibodies, sugars, fatty acids, nucleotides and nucleotide analogs, analogs of naturally 
occurring structures (e.g., peptide mimetics, nucleic acid analogs, and the like), and numerous 
other compounds. 

In one embodiment, the invention provides assays for screening test 

20 compounds which bind to CLASP-2 polypeptides. The assays can be recombinant cell based 
or cell-free assays. These assays can include the steps of combining a cell expressing a 
CLASP-2 polypeptide or a binding fragment thereof, and a compound or agent under 
conditions which allow binding of the compound or agent to the CLASP-2 polypeptide to 
form a complex. Complex formation can then be determined. The ability of the candidate 

25 compound or agent to bind to the CLASP-2 polypeptide or fragment thereof is indicated by 
the presence of the candidate compound in the complex. Formation of complexes between the 
CLASP-2 polypeptide and the candidate compound can be quantitated, for example, using 
standard immunoassays. 

In another embodiment, the invention provides screening assays to identify 

30 test compounds which modulate the interaction (and most likely CLASP-2 activity as well) 
between a CLASP-2 polypeptide and a molecule (target molecule with which the CLASP-2 
polypeptide normally interacts. 

In one embodiment, these CLASP-2 target molecules can be tyrosine kinases 
(e.g., lyn, lck, fyn, ZAP-70m SyK, and CSK). In another embodiment, these CLASP-2 target 
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molecules can be tyrosine phosphatases {e.g., EZRIN, SHP-1, SHP-2 and PTP36). In another 
embodiment, these CLASP-2 target molecules can be adaptor proteins {e.g., NCK, CBL, 
SHC, LNK, SLP-76, HS1, SIT, VAV, GrB2, and BRDG1). In another embodiment, these 
CLASP-2 target molecules can be cytoskeletal associated proteins such as ankyrin, spectrin, 
5 talin, ezrin, tropomyosin, myosin, plectin, syndecans, paralemmin, Band 3 protein, 

cytoskeletal protein 4.1 , and PTP36. In a further embodiment, CLASP-2 target molecules 
can be members of the integrin family. 

Typically, the assays are recombinant cell based or cell-free assay. These 
assays can include the steps of combining a cell expressing a CLASP-2 polypeptide or a 
10 binding fragment thereof, a CLASP-2 target molecule {e.g., a CLASP-2 ligand) and a test 
compound, under conditions where but for the presence of the candidate compound, the 
CLASP-2 polypeptide or biologically active portion thereof binds to the target molecule. 
Detecting complex formation between the CLASP-2 polypeptide or the binding fragment 
thereof, the CLASP-2 target molecule and a test compound detecting the formation of a 
1 5 complex which includes the CLASP-2 polypeptide and the target molecule can be 

accomplished. Detection of complex formation can include direct quantitation of the 
complex by, for example, measuring inductive effects, such as T cell activation, of the 
CLASP-2 polypeptide. A significant change, such as a decrease, in the interaction of the 
CLASP-2 and target molecule {e.g., in the formation of a complex between the CLASP-2 and 
20 the target molecule) in the presence of a candidate compound (relative to what is detected in 
the absence of the candidate compound) is indicative of a modulation of the interaction 
between the CLASP-2 polypeptide and the target molecule. Modulation of the formation of 
complexes between the CLASP-2 polypeptide and the target molecule can be quantitated 
using, for example, an immunoassay. To perform cell free drug screening assays, it is 
25 desirable to immobilize either CLASP-2 or its target molecule to facilitate separation of 
complexes from uncomplexed forms of one or both of the polypeptides, as well as to 
accommodate automation of the assay. CLASP-2 binding to a target molecule, in the 
presence and absence of a candidate compound, can be accomplished in any vessel suitable . 
for containing the reactants. Examples of such vessels include microtitre plates, test tubes, 
3 0 and microcentrifuge tubes . 

In one embodiment, a fusion polypeptide can be provided which adds a 
domain that allows the polypeptide to be bound to a matrix. Alternatively, the complexes can 
be dissociated from the matrix, separated by SDS-P AGE, and the level of CLASP-2-binding 

90 

3NSDOCID:<WO 0231117A2 I > 


WO 02/31117 


PCT/US01/32202 


polypeptide found in the bead fraction quantitated from the gel using standard electrophoretic 
techniques. 

Other techniques for immobilizing polypeptides on matrices can also be used 
in the drug screening assays of the invention. For example, either CLASP-2 or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
CLASP-2 molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 111.), 
and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with CLASP-2 but which do not interfere with binding of 
the polypeptide to its target molecule can be derivatized to the wells of the plate, and 
CLASP-2 trapped in the wells by antibody conjugation. As described above, preparations of a 
CLASP-2 -binding polypeptide and a candidate compound ate incubated in the CLASP-2 - 
presenting wells of the plate, and the amount of complex trapped in the well can be 
quantitated. Methods for detecting such complexes include immunodetection of complexes 
using antibodies reactive with the CLASP-2 target molecule, or which are reactive with 
CLASP-2 polypeptide and compete with the target molecule; as well as enzyme-linked assays 
which rely on detecting an enzymatic activity associated with the target molecule. 

One method of drug screening utilizes eukaryotic or prokaryotic host cells 
which are stably transformed with recombinant DNA molecules expressing the CLASP-2, 
e.g., the protein having the sequence of SEQ ID NO: 2. Such cells, either in viable or fixed _ 
form, can be used for standard ligand/receptor binding assays (see, e.g., Parce et al. (1989) 
Science 246: 243-247; and Owicki et al (1990) Proc. Natl Acad. Sci. U.S.A. 87: 4007-4011, 
which describe sensitive methods to detect cellular responses. A test compound, often 
labeled, can be assayed for binding or for competition with another ligand for binding. 
Viable cells could also be used to screen for the effects of drugs on CLASP-2 mediated 
functions, e.g., T cell activation, second messenger levels, and others). 

In another embodiment, the invention provides a method for identifying a 
compound (e.g., a screening assay) capable of use in the treatment of a disorder characterized 
by (or associated with) aberrant or abnormal CLASP-2 nucleic acid expression or CLASP-2 
polypeptide activity. This method typically includes the step of assaying the ability of the 
compound or agent to modulate the expression of the CLASP-2 nucleic acid or the activity of 
the CLASP-2 polypeptide thereby identifying a compound for treating a disorder 
characterized by aberrant or abnormal CLASP-2 nucleic acid expression or CLASP-2 
polypeptide activity. 
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Methods for assaying the ability of the compound or agent to modulate the 
expression of the CLASP-2 nucleic acid or activity of the CLASP-2 polypeptide are typically 
cell-based assays. For example, cells which are sensitive to ligands which transduce signals 
via a pathway involving CLASP-2 can be induced to overexpress a CLASP-2 polypeptide in 
the presence and absence of a candidate compound. Candidate compounds which produce a 
change in CLASP-2-dependent responses can be identified. In one embodiment, expression 
of the CLASP-2 nucleic acid or activity of a CLASP-2 polypeptide is modulated in cells and 
the effects of candidate compounds on the readout of interest (such as T cell activation) are 
measured. For example, the expression of genes which are up- or down-regulated in response 
to a CLASP-2-dependent signal cascade can be assayed. 

Alternatively, modulators of CLASP-2 expression can be identified in a 
method where a cell is contacted with a candidate compound and the expression of CLASP-2 
mRNA or polypeptide in the cell is determined. The level of expression of CLASP-2 mRNA 
or polypeptide in the presence of the candidate compound is compared to the level of 
expression of CLASP-2 mRNA or polypeptide in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of CLASP-2 nucleic acid 
expression based on this comparison. For example, when expression of CLASP-2 mRNA or 
polypeptide is greater in the presence of the candidate compound than in its absence, the 
candidate compound is identified as a stimulator of CLASP-2 nucleic acid expression. 
Alternatively, when CLASP-2 nucleic acid expression is less in the presence of the candidate 
compound than in its absence, the candidate compound is identified as an inhibitor of 
CLASP-2 nucleic acid expression. The level of CLASP-2 nucleic acid expression in the cells 
can be determined by methods described herein for detecting CLASP-2 mRNA or ' 
polypeptide. 

Modulators of CLASP-2 polypeptide activity and CLASP-2 nucleic acid 
expression identified according to these drug screening assays can be used to treat, for 
example, immune disorders. These methods of treatment include the steps of administering 
the modulators of CLASP-2 polypeptide activity or nucleic acid expression, e.g., in a 
pharmaceutical composition as described in §5.10.1 below, to a subject in need of such 
treatment, e.g., a subject with a disorder described herein. 

5.10. Therapeutic Administration of CLASP-2 Modulators 

The CLASP-2 protein is expressed in lymphocytes and, as noted supra, play a 
role in regulating T cell and B cell interactions, thus making CLASP-2 activity {e.g., CLASP- 
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2 binding of regulatory proteins) a target for diagnostic and treatment of immune disorders 
and for modulation of immune function (e.g., T cell activation). Additionally, since CLASP-2 
contains domains capable of transducing an intracellular signal, cell surface CLASP-2 can be 
triggered by an anti- CLASP-2 antibody or soluble CLASP-2 or a fragment thereof in order 
5 to enhance the activation state of a lymphocyte. 

5.10.1. Formiiiation and Route of Administration 

A CLASP-2 polypeptide, a fragment thereof, anti-CLASP-2 antibody, 
CLASP-2 polynucleotide (e.g., antisense or ribozyme), or small molecule agonists or 
antagonists can be administered to a subject perse or in the form of a pharmaceutical or 

10 therapeutic composition. Pharmaceutical compositions comprising the proteins of the 

invention can be manufactured by means -of -conventional -mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. 
Pharmaceutical compositions can be formulated in conventional manner using one or more 
physiologically acceptable carriers, diluents, excipients or auxiliaries which facilitate 

15 processing of the protein or active peptides into preparations which can be used 

pharmaceutical^'. Proper formulation is dependent upon the route of administration chosen. 

Currently, there are three major classes of protein-derived cell-penetrating 
peptides that have been used for delivering of proteins into cells and animals (Lindgren, M.; 
et aL, 2000, Trends Pharmacol Sci. 21: 99-103). In one embodiment, the CLASP-2 protein 

20 or fragment (encoding a functional domain of CLASP-2) can be introduced into the cell as a 
fusion protein tied to a transporter protein derived from homeoprotein transcription factors 
such as ANTP. In another embodiment, the CLASP-2 protein or fragment (encoding a 
functional domain of CLASP-2) can be introduced into the cell as a fusion protein tied to 
other transcription factors such as the HIV Tat protein and the herpes simplex vims type 1 

25 (HSV-1) VP22 protein. Members in this family have been widely used in different cellular 
and animal systems (Schwarze, S.; et aL; 2000, Trends Pharmacol Sci. 21 : 45-48). In another 
embodiment, the CLASP-2 protein or fragment (encoding a functional domain of CLASP-2) 
can be introduced into the cell as a fusion protein tied to peptides derived from signal- 
sequences present in several proteins such as HIV-1 gp41. In other embodiments, there are 

30 several synthetic and/or chemeric cell-penetrating peptides such as transportan and 

Amphiphiloc model peptide (Lindgren, M.; et aL, 2000, Trends Pharmacol Sci. 21: 99-103) 
that can be used. In another embodiment, the CLASP-2 protein or fragment can be 
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introduced by using anti-DNA antibodies (see, e.g., Zack, D. J., et al. 9 1996, J. Immunol. 
157:2082-8 

For topical administration the proteins of the invention can be formulated as 
solutions, gels, ointments, creams, suspensions, and the like, as are well-known in the art. 
5 Systemic formulations include those designed for administration by injection, 

e.g., subcutaneous, intravenous, intramuscular, intrathecal or intraperitoneal injection, as well 
as those designed for transdermal, transmucosal, oral or pulmonary administration. 

For injection, the proteins of the invention can be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
10 solution, or physiological saline buffer. The solution can contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. Alternatively, the proteins can be in powder 
form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use. 

For transmucosal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 
1 5 For oral administration, a composition can be readily formulated by 

combining the proteins with pharmaceutically acceptable carriers well known in the art. Such 
carriers enable the proteins to be formulated as tablets, pills, dragees, capsules, liquids, gels, 
syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated. For oral 
solid formulations such as, for example, powders, capsules and tablets, suitable excipients 
20 include fillers such as sugars, such as lactose, sucrose, mannitol and sorbitol; cellulose 
preparations such as maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropyhnethyl-cellulose, sodium 

carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP); granulating agents; and binding 
agents. If desired, disintegrating agents can be added, such as the cross-linked 
25 polyvinylpyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. 

If desired, solid dosage forms can be sugar-coated or enteric-coated using 

standard techniques. 

For oral liquid preparations such as, for example, suspensions, elixirs and 
solutions, suitable carriers, excipients or diluents include water; glycols, oils, alcohols, and 
30 the like. Additionally, flavoring agents, preservatives, coloring agents and the like can be 
added. 

For buccal administration, the proteins can take the form of tablets, lozenges, 
and the like, formulated in conventional manner. 
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For administration by inhalation, the proteins for use according to the present 
invention are conveniently delivered in the form of an aerosol spray from pressurized packs 
or a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In 
the case of a pressurized aerosol the dosage unit can be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use in an inhaler or 
insufflator can be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. * 

The proteins can also be formulated in rectal or vaginal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases such as 
cocoa butter or other glycerides. 

In addition to the formulations described previously, the proteins can also be 
formulated as a depot preparation. Such long acting formulations can be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the proteins can be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

Alternatively, other pharmaceutical delivery systems can be employed. 
Liposomes and emulsions are well known examples of delivery vehicles that can be used to 
deliver the proteins or peptides of the invention. Certain organic solvents such as 
dimethylsulfoxide also can be employed, although usually at the cost of greater toxicity. 
Additionally, the proteins can be delivered using a sustained-release system, such as 
semipermeable matrices of solid polymers containing the therapeutic agent. Various 
sustained-release materials have been established and are well known by those skilled in the 
art. Sustained-release capsules can, depending on their chemical nature, release the proteins 
for a few weeks up to over 100 days. Depending on the chemical nature and the biological 
stability of the therapeutic reagent, additional strategies for protein stabilization can be 
employed. 

As the proteins and peptides of the invention can contain charged side chains 
or termini, they can be included in any of the above-described formulations as the free acids 
or bases or as pharmaceutically acceptable salts. Pharmaceutically acceptable salts are those 
salts which substantially retain the biologic activity of the free bases and which are prepared 
by reaction with inorganic acids. Pharmaceutical salts tend to be more soluble in aqueous 
and other protic solvents than are the corresponding free base forms. 
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5,10.2. Effective Dosages 

CLASP-2 polypeptides, CLASP-2 fragments and anti-CLASP-2 antibodies 
will generally be used in an amount effective to achieve the intended purpose. For use to 
inhibit an immune response, the proteins of the invention, or pharmaceutical compositions 
5 thereof, are administered or applied in a therapeutically effective amount. By therapeutically 
effective amount is meant an amount effective ameliorate or prevent the symptoms, or 
prolong the survival of, the patient being treated. Determination of a therapeutically effective 
amount is well within the capabilities of those skilled in the art, especially in light of the 
detailed disclosure provided herein. 
10 For systemic administration, a therapeutically effective dose can be estimated 

initially from in vitro assays. For example, a dose can be formulated in animal models to 
achieve a circulating concentration range that includes the IC50 as determined in cell culture 
(i.e., the concentration of test compound that inhibits 50% of CLASP-2 binding interactions). 
Such information can be used to more accurately determine useful doses in humans. 
15 Initial dosages can also be estimated from in vivo data, e.g., animal models, 

using techniques that arc well known in the art. One having ordinary skill in the art could 
readily optimize administration to humans based on animal data. 

Dosage amount and interval can be adjusted individually to provide plasma 
levels of the proteins which are sufficient to maintain therapeutic effect. Usual patient 
20 dosages for administration by injection range from about 0.1 to 5 mg/kg/day, preferably from 
about 0.5 to 1 mg/kg/day. Therapeutically effective serum levels can be achieved by 
administering multiple doses each day. 

In cases of local administration or selective uptake, the effective local 
concentration of the proteins can not be related to plasma concentration. One having skill in 
25 the art will be able to optimize therapeutically effective local dosages without undue 
experimentation. 

The amount of CLASP-2 administered will, of course, be dependent on the 
subject being treated, on the subject's weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 
30 The therapy can be repeated intermittently while symptoms detectable or even 

when they are not detectable. The therapy can be provided alone or in combination with 
other drugs. In the case of autoimmune disorders, the drugs that can be used in combination 
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with CLASP-2 or fragments thereof include, but are not limited to, steroid and non-steroid 
immunosuppressive agents. 

5.10.3. Toxicity 

Preferably, a therapeutically effective dose of the proteins described herein 
5 will provide therapeutic benefit without causing substantial toxicity. 

Toxicity of the proteins described herein can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., by determining the 
LD50 (the dose lethal to 50% of the population) or the LD100 (the dose lethal to 100% of the 
population). The dose ratio between toxic and therapeutic effect is the therapeutic index. 
1 0 The data obtained from these cell culture assays and animal studies can be used in 

formulating a dosage range that is not toxic for use in human. The dosage of the proteins 
described herein lies preferably within a range of circulating concentrations that include the 
effective dose with little or no toxicity. The dosage can vary within this range depending 
upon the dosage form employed and the route of administration utilized. The exact 
1 5 formulation, route of administration and dosage can be chosen by the individual physician in 
view of the patient's condition. (See, e.g., Fingl et ah, 1975, In: The Pharmacological Basis 
of Therapeutics , Ch.l, p.l). 

5.1 1 . Binding Assays 

CLASP-2 polypeptides can be used to screen for molecules that bind to 
20 CLASP-2 or for molecules to which CLASP-2 binds. The binding of CLASP-2 by the 
molecule can activate (agonist), increase, inhibit (antagonist), or decrease activity of the 
CLASP-2 or the molecule bound. Examples of such molecules include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. Preferably, the molecule is 
closely related to the natural ligand of CLASP-2, e.g., a fragment of the ligand, or a natural 
25 substrate, a ligand, a structural or functional mimetic. (See, Coligan et aL, Current Protocols 
in Immunology 1(2): Chapter 5 (1991).) Similarly, the molecule can be closely-related to the 
natural receptor to which CLASP-2 binds, or at least, a fragment of the receptor capable of 
being bound by CLASP-2 (e.g., active site). In either case, the molecule can be rationally 
designed using known techniques. 
30 Preferably, the screening for these molecules involves producing appropriate 

cells which express CLASP-2, either as a secreted protein or on the cell membrane. Preferred 
cells include cells from mammals, yeast, Drosophila, or E. coli. Cells expressing CLASP-2 
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(or cell membrane containing the expressed polypeptide) are then preferably contacted with a 
test compound potentially containing the molecule to observe binding, stimulation, or 
inhibition of activity of either CLASP-2 or the molecule. 

The assay can simply test binding of a candidate compound to CLASP-2, 
where binding is detected by a label, or in an assay involving competition with a labeled 
competitor. Further, the assay can test whether the candidate compound results in a signal 

generated by binding to CLASP-2. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide affixed to a solid support, chemical libraries, or natural product mixtures. The 
assay can also simply comprise the steps of mixing a candidate compound with a solution 
containing CLASP-2, measuring CLASP-2 activity or binding, and comparing the CLASP-2 
activity or binding to a standard. Preferably, an ELISA assay can measure CLASP-2 level or 
activity in a sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure CLASP-2 level or activity by either binding, directly or indirectly, to 
CLASP-2 or by competing with CLASP-2 for a substrate. 

In another aspect of the invention, the CLASP-2 polypeptides, or fragments 
thereof, can be used as ic bait proteins" in a two-hybrid assay (see, e.g., U.S. Pat. No. 
5,283,317; Zervos et al, 1993, Cell 72: 223-232; Madura et aL, 1993, J. Biol. Chem. 268: 
12046-12054; Bartel et al, 1993, Biotechniques 14: 920-924; Iwabuchi et aU 1993, 
Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins, which bind to 
or interact with CLASP-2 ("CLASP-2-binding proteins" or "CLASP-2-bp") and modulate 
CLASP-2 polypeptide activity. Such CLASP-2-binding proteins are also likely to be involved 
in the propagation of signals by the CLASP-2 polypeptides as, for example, upstream or 
downstream elements of the CLASP-2 pathway. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient by activating or inhibiting the CLASP-2 molecule. Moreover, the 
assays can discover agents which can inhibit or enhance the production of CLASP-2 from 
suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds or 
agents that bind to CLASP-2 polypeptides comprising the steps of: (a) contacting a CLASP-2 
polypeptide with a compound or agent under conditions which allow binding of the 
compound to the CLASP-2 polypeptide to form a complex and (b) determining if binding 
has occurred. Moreover, the invention includes a method of identifying agonists or 
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antagonists comprising the steps of: (a) incubating a candidate compound with CLASP-2, (b) 
assaying a biological activity, and (b) determining if a biological activity of CLASP-2 has 
been altered. 

Several methods of automating assays have been developed in recent years so 
5 as to permit screening of tens of thousands of compounds in a short period. See, e.g., Fodor 
et al. y 1991, Science 251 : 767-773, and other descriptions of chemical diversity libraries, 
which describe means for testing of binding affinity by a plurality of compounds. 

5.12. Other Uses of CLASP-2 Polynucleotides and Polypeptides 

The polynucleotides, polypeptides, polypeptide homologues, modulators, and 
10 antibodies described herein can be used in one or more of the following methods: a) drug 

screening assays; b) diagnostic assays particularly in disease identification, allelic screening 
and pharmocogenetic testing; and c) pharmacogenomics. A CLASP-2 polypeptide of the 
invention has one or more of the activities described herein and can thus be used to, for 
example, modulate an immune response in an immune cell, for example by binding to a 
15 CLASP-2 binding partner making it unavailable for binding to the naturally present CLASP-2 
polypeptide. 

In one embodiment, these CLASP-2 binding partners can be tyrosine kinases 
(e.g., lyn, Ick, fyn, ZAP-70m SyK, and CSK). In another embodiment, these CLASP-2 
binding partners can be tyrosine phosphatases {e.g., EZRIN, SHP-1, SHP-2 and PTP36). In 

20 another embodiment, these CLASP-2 target molecules can be adaptor proteins (e.g., NCK, 
CBL, SHC, LNK, SLP-76, HS1, SIT, VAV, GrB2, and BRDG1. In another embodiment, 
these CLASP-2 binding partners can be cytoskeletal associated proteins such as ankyrin, 
spectrin, talin, ezrin, tropomyosin, myosin, plectin, syndecans, paralemmin, Band 3 protein, 
cytoskeletal protein 4.1, and PTP36. In a further embodiment, CLASP-2 binding partners 

25 can be members of the integrin family.The isolated nucleic acid molecules of the invention 
can be used to express CLASP-2 polypeptide (e.g., via a recombinant expression vector in a 
host cell or in gene therapy applications), to detect CLASP-2 mRNA (e.g., in a biological 
sample) or a naturally occurring or recombinantly generated genetic mutation in an CLASP-2 
gene, and to modulate CLASP-2 activity, as described further below. In addition, the 

30 CLASP-2 polypeptides can be used to screen drugs or compounds which modulate CLASP-2 
polypeptide activity as well as to treat disorders characterized by insufficient production of 
CLASP-2 polypeptide or production of CLASP-2 polypeptide forms which have decreased 
activity compared to wild type CLASP-2. Moreover, the anti-CLASP-2 antibodies of the 
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invention can be used to detect and isolate an CLASP-2 polypeptide, particularly fragments 
of CLASP-2 present in a biological sample, and to modulate CLASP-2 polypeptide activity. 

5.13. Diagnostic Assays 

The invention further provides a method for detecting the presence of CLASP- 
5 2, or fragment thereof, in a biological sample. Usually the biological sample contains 

lymphocytes {e.g., from blood). The method involves contacting the biological sample with a 
compound or an agent capable of detecting CLASP-2 polypeptide or mRNA such that the 
presence of CLASP-2 is detected in the biological sample. 

A preferred agent for detecting CLASP-2 mRNA is a directly or indirectly 
10 labeled nucleic acid probe capable of hybridizing to CLASP-2 mRNA. The nucleic acid 
probe can be, for example, the full-length CLASP-2 cDNA of SEQ ID NO: 1 , or a portion 

7 

thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in 
length and sufficient to specifically hybridize under stringent conditions to CLASP-2 mRNA. 

A preferred agent for detecting CLASP-2 polypeptide is a directly or 
15 indirectly labeled antibody capable of binding to a CLASP-2 polypeptide. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., 
Fab or F(ab) 2 ) can be used. The term "directly or indirectly", with regard to the probe or 
antibody, is intended to encompass direct labeling of the probe or antibody by coupling (/.<?., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
20 labeling of the probe or antibody by reactivity with another reagent that is directly labeled. 
Examples of indirect labeling include detection of a primary antibody using a fluorescently 
labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
detected with fluorescently labeled streptavidin. The detection method of the invention can be 
used to detect CLASP-2 mRNA or polypeptide in a biological sample in vitro as well as in 
25 vivo. For example, in vitro techniques for detection of CLASP-2 mRNA include Northern 
hybridizations and in situ hybridizations. In vitro techniques for detection of CLASP-2 
polypeptide include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations and immunofluorescence. Alternatively, CLASP-2 polypeptide can be 
detected in vivo in a subject by introducing into the subject a labeled anti-CLASP-2 antibody. 
30 For example, the antibody can be labeled with a radioactive marker whose presence and 

location in a subject can be detected by standard imaging techniques. Particularly useful are 
methods which detect the allelic variant of CLASP-2 expressed in a subject and methods 
which detect fragments of an CLASP-2 polypeptide in a sample. 
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The invention also encompasses kits for detecting the presence of CLASP -2 in 
a biological sample: For example, the kit can comprise a directly or indirectly labeled 
compound or agent capable of detecting CLASP-2 polypeptide or mRNA in a biological 
sample; means for determining the amount of CLASP-2 in the sample; and means for 
5 comparing the amount of CLASP-2 in the sample with a standard. The compound or agent 
can be packaged in a suitable container. The kit can further comprise instructions for using 
the kit to detect CLASP-2 mRNA or polypeptide. 

The methods of the invention can also be used to detect naturally occurring 
genetic mutations in an CLASP-2 gene, thereby determining if a subject with the mutated 
10 gene is at risk for a disorder characterized by aberrant or abnormal CLASP-2 nucleic acid 
expression or CLASP-2 polypeptide activity as described herein. In preferred embodiments, 
the methods include detecting, in a sample of cells from the subject, the presence or absence 
of a genetic mutation characterized by at least one of an alteration affecting the integrity of a 
gene encoding an CLASP-2 polypeptide, or the misexpression of the CLASP-2 gene. 

15 5.14, Biological Activities of CLASP-2 

As described herein, CLASP-2 mediates a variety of cell functions in 
lymphocytes and other cells. As described herein, a variety of assays are useful for detecting 
or quantitating CLASP-2 activity, or for identifying agents (including polynucleotides, 
polypeptides, and antibodies of the invention) that modulate CLASP-2 activity (z.e., 

20 biological activity, e.g., binding) or expression. Such agents are useful for treatment of 

diseases and conditions associated with aberrant CLASP-2 expression or activity. Further, 
following the guidance provided herein, other CLASP-2-mediated activities can be identified 
by those of skill using routine assays, such as those described below. 

Exemplary assays for CLASP-2 function (or modulation of function) include 

25 assays for modulation of an in vitro or in vivo cell response {e.g., an immune response such 
as lymphocyte activation, antibody production, inflammation) by detecting a change in an 
activity (e.g., cytokine production, calcium flux, tyrosine phosphorylation, regulation of early 
activation markers, cell metabolism, proliferation, and the like, as described below) of cells in 
vitro or in vivo. In one embodiment, the cells are lymphocytes. 

30 In one assay, for example, recombinant CLASP-2 protein, peptides, or 

antibodies corresponding to the CLASP-2 extracellular domain can be mixed directly with T 
and B cells. Cytokine production by these cells can then be measured and the degree of 
modulation of the immune response quantitated. Alternatively, antigen-presenting B cells are 
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mixed with untransfected T cells or T cells that have been transfected with CLASP-2 
isoforms. Cytokine production (or calcium flux or other assays in §5.14.3) is be measured at 
the appropriate time to determine the effect of CLASP-2 on such an immune response. In a 
similar assay, B cells transfected with CLASP-2 constructs are tested for their ability to 
stimulate a T cell to generate an immune response. Transfected constructs in any of these 
cases could encode, for example, full or partial length CLASP-2 sequences, or antisense 
constructs to inhibit translation of endogenous CLASP-2 gene. Any of the examples 
described herein can be used to stimulate an immune response in the presence or absence of 
CLASP-2 isoforms or antibodies and assay the resulting effects on immune response by the 
methods listed in §5.14.3. 

* 

5.14.1 Methods for Generating an Immune Response in vitro 

In various assays, an effect of an agent on immune cells is detected using an in 
vitro assay. The degree of an immune response can be measured or quantitated by a number 
of standard assays including those described below. 

In one assay, human peripheral blood mononuclear cells (PBMC), human T 
cell clones (e.g., Jurkat E6, ATCC TIB- 152), EBV-transformed B cell clones (e.g., 9D10, 
ATCC CRL-8752), antigen-specific T cell clones or lines can be used to examine immune 
responses in vitro. Activation, enhanced activation or inhibition of activation of these cells or 
cell lines can be used for the evaluation of potential CLASP therapeutics. Standard methods 
by which hematopoietic cells are stimulated to undergo activation characteristic of an 
immune response are, for example: 

A) Antigen specific stimulation of immune responses. Either pre-immunized 
or naive mouse splenocytes can be generated by standard procedures. Li addition, antigen- 
specific T cell clones and hybridomas (e.g., MBP-specific), and numerous B cell lymphoma 
cell lines (e.g., CH27), have been previously characterized are available for the assays 
discussed below. Antigen specific splenocytes or B-cells can be mixed with specific T-cells 
in the presence of antigen to generate an immune response. This can be performed in the 
presence or absence of CLASP-2 to assay whether CLASP-2 modulates the immune response 
as measured by any of the assays in section 5.14.2. 

B) Non-specific T cell activation. The following methods can be used to 
activate T cells in the absence of antigen: 1) cross-linking T cell receptor (TCR) by addition 
of antibodies against receptor activation molecules (e.g., TCR, CD3, or CD2) together with 
antibodies against co-stimulator molecules, for example anti-CD28; 2) activating cell surface 
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receptors in a non-specific fashion using lectins such as concanavalin A (con A) and 
phytohemagglutinin (PHA); 3) mimicking cell surface receptor-mediated activation using 
pharmacological agents that activate protein kinase C (e.g., phorbol esters) and increase 
cytoplasmic Ca (e.g., ionomycin). 

C) Non-specific B cell activation: 1) application of antibodies against cell 
surface molecules such as IgM, CD20, or CD21. 2) Lipopolysaccharide (LPS), phorbol 
esters, calcium ionophores and ionomycin can also be used to by-pass receptor triggering. 

D) Mixed lymphocyte reaction (MLR). Mix donor PBMC with recipient 
PBMC to activate lymphocytes by presentation of mismatched tissue antigens, which occurs 
in all cases except identical twins. 

E) Generation of a specific T cell clone or line that recognizes a particular 
antigen. A standard approach is to generate tetanus toxin-specific T cells from a donor that 
has recently been boosted with tetanus toxin. Major histocompatability complex- (MHC-) 
matched antigen presenting cells and a source of tetanus toxin are used to maintain antigen 
specificity of the cell line or T cell clone (Lanzavecchia, A., et al y 1983, Eur. J. Immun. 13: 
733-738). 

The anticipated mechanism of action of a CLASP-2 polypeptide or 
polynucleotide should define the appropriate assay to use to investigate its potential 
enhancement or inhibition of lymphocyte activation. For example, soluble proteins 
containing the CLASP extracellular domain may interfere with the interaction between T 
cells and antigen presenting cells. Such interaction plays a role in the MLR and in antigen- 
specific T cell activation, but not in non-specific T or B cell activation. The assays described 
above have the advantage of several possible detection methods for quantitation. 

5.14.2. Methods for Generating an Immune Response in vivo 

In various assays, an effect of an agent on immune cells is detected using an in 
vivo assay. The degree of an immune response can be measured or quantitated by a number 
of standard assays including those described below. 

(A) Animal Model for Transplantation Rejection: Ectopic Heart 

Transplantation 

In one embodiment, a standard animal model for graft versus host rejection is 
ectopic heart transplantation (Fulmer et al 9 1963, Am. J. Anat. 113: 273-281). This method 
involves using BALB/C mice (either sex, and range from 1-9 months) for transplanting 
cardiac tissue into a surgically-created pocket on the dorsum for both ears made by slitting 
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the skin over the auricular artery at the base of the ear. Small curved forceps are forced into 
the slit, bluntly dissecting between the skin and the cartilage plate. Donor tissue is eased into 
the base of the pocket near the distal edge of the ear. The auricular artery is used to seal off 
the opening of the pocket. Within 10 to 14 days pulsatile activity of the transplant should be 
observed. Gross appearance of the graft, patterns of vacuolar supply to the graft area and 
pulsatile activity can be easily observed utilizing transilluminated light during the first three 
weeks post-transplantation. Follow-up can continue for for several months. 

(B) Animal model for Autoimmune Disease: Induction of Collagen 
Induced Arthritis (CIA) 

Collagen Induced Arthritis (CIA) is a standard model for studying progression 
and immune (Courtenay et al y 1980, Nature 283: 666 and Wooley et aL, 1981, J. Exp. Med. 
154: 688). DBA/a mice can be used as an assay for the in vivo relevance of CLASP-2 in 
vitro testing potential immune therapeutics. In vivo experiments will be performed to 
examine the ability of potential therapeutics to prevent CIA. We will use 3-5 mice per group 
to statistically justify our results. 

Once a titer of the potency of collagen type II (CII) is obtained therapeutics 
can be tested. In one embodiment, three mice will be immunized with three different 
concentrations of CII 50, 200, and 400 \ig per animal (Nabozny et al, 1996, J. Exp. Med., 
183: 27-37). To induce CIA, animals can be immunized with an appropriate concentration of 
CII, determined as described above. One half of a 1:1 ratio of antigen:CFA can be injected at 
the base of the tail and the remainder equally divided in each hind footpad. Mice can be 
carefully monitored every day for the onset and progression of CIA thoughout the experiment 
until its termination 12 weeks post-immunization with CII. The pieces of heart transplanted 
can be approximately 3X3 mm in size. The severity of arthritis can be assessed following 
standard procedures known to one of skill in the art. 

5.14.3 Assay Quantitation 

(A) Tyrosine phosphorylation 

Tyrosine phosphorylation of early response proteins such as HS1, PLC-r, 
ZAP-76, and Vav is an early biochemical event following T cell activation. The tyrosine 
phosphorylated proteins can be detected by Western blot using antibodies against 
phosphorylated tyrosine residues. Tyrosine phosphorylation of these early response proteins 
can be used as a standard assay for T cell activation (J. Biol. Chem., 1997, 272(23): 14562- 
14570). Any change in the phosphorylation pattern of these or related proteins when immune 
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responses are generated in the presence of CLASP-2 is indicative of a CLASP-2 modulation 
of this response. 

( B ) Intracellular Calcium Flux 

Tlic kinetics of intracellular Ca concentrations are measured over time after 
stimulation of cells preloaded with a calcium sensitive dye. Upon binding the Ca 2+ indicator 
dye, Fluor-4 (Molecular Probes), exhibits an increase in fluorescence level using flow 
cytometry, solution fluorometry, and confocal microscopy. Any change in the level or timing 
of calcium flux when immune responses are generated in the presence of CLASP-2 is 
indicative of a CLASP-2 modulation of this response 

( C ) Regulation of early activation markers 

Increased and diminished expression/regulation of early lymphocyte activation 
marker levels such as CD69, 1L-2R, MHC class II, B7, and TCR are commonly measured 
with fluoresccntly labeled antibodies using flow cytometry. All antibodies are commercially 
available. Any change in the expression levels of lymphocyte activation markers when 
immune responses arc generated in the presence of CLASP-2 is indicative of a CLASP-2 
modulation of this response. 

(D) Increased metabolic activity/acid release 

Activation of most known signal transduction pathways trigger increases in 
acidic metabolites. This reproducible biological event is measured as the rate of acid release 
using a microphysiometer (Molecular Devices), can be used as an early activation marker 
when comparing the treatment of cells with potential biological therapeutics (McConnell, 
KM. et al. 9 1992, Science 257: 1906-1912 and McConnell, H.M., 1995, Proc. Natl. Acad. 
Sci. 92: 2750-2754). Any statistically significant increase or decrease in acid release of 
CLASP-2-treated sample, as compared to control sample (no treatment), suggest and effect of 
CLASP-2 on biological function. 

(E) Cell proliferation/cell viability assays 
(1) 3 H-thimidine incorporation 

Exposure of lymphocytes to antigen or mitogen in vitro induces DNA 
synthesis and cellular proliferation. The measurement of mitotic activity by H-thimidine 
incorporation into newly synthesized DNA is one of the most frequently used assays to 
quantitative T cell activation. Depending on the cell population and form of stimulation used 
to activate the T cells, mitotic activity can be measured within 24-72 hrs. in vitro, post H- 
thimidine pulse (Mishell, B. B. and S. M. Shiigi, 1980, Selected Methods in Cellular 
Immunology, W. H. Freeman and Company and Dutton, R. W. and Pearce, J. D., 1962, 
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Nature 194: 93). Any statistically significant increase or decrease in CPM of CLASP-2- 
treated sample, as compared to control sample (no treatment), suggest and effect of CLASP-2 

on biological function. 

(2) MTS [5-(3-carboxymethoxyphenyl)-2^4,5-dimethylthiazolyl)-3(4- 

sulfophenyl)tetrazolium, inner salt] is a colorimetric method for determining the number of 
viable cells in proliferation or cytotoxicity assays (Barltrop, J.A. et al, 1991, Bioorg. & Med. 
Chem. Lett. 1:611). 1-5 days after lymphocyte activation, MTS tetrazolium compound, 
Owen's reagent, is bioreduced by cells into a colored formazan product that is soluble in 
tissue culture media. Color intensity is read at 490 nm minus 650 nm using a microplate 
reader. Any statistically significant increase or decrease in color intensity of CLASP-2- 
treated sample, as compared to control sample (no treatment), can suggest an effect of 
CLASP-2 on biological function (Mosmann, T., 1983, J. Immunol. Methods 65: 55 and 

Barltrop, J.A. et al (1991)). 

(3) Bromodeoxyuridine (BrdU), a thymidine analogue, readily 
incorporates into cells undergoing DNA synthesis. BrdU-pulsed cells are labeled with an 
enzyme-conjugated anti-BrdU antibody (Gratzner, H.G., 1982, Science 218: 474-475.). A 
colorimetric, soluble substrate is used to visualize proliferating cells that have incorporated 
BrdU. Reaction is stopped with sulfuric acid and plate is read at 450 nm using a microplate 
reader. Any statistically significant increase or decrease in color intensity of CLASP-2- 
treated sample, as compared to control sample (no treatment), suggest an effect of CLASP-2 
on biological function. 

(F) Apoptosis by Annexin V 

Programmed cell death or apoptosis is an early event in a cascade of catabolic 
reactions leading to cell death. A lose in the integrity of the cell membrane allows for the 
binding of fluorescently conjugated phosphatidylserine. Stained cells can be measured by 
fluorescence microscopy and flow cytometry (Vermes, L, 1995, J. Immunol. Methods. 180: 
39-52). In one embodiment, any statistically significant increase or decrease in apoptotic cell 
number of CLASP-2-treated sample, as compared to control sample (no treatment), suggest 
an effect of CLASP-2 on biological function. For evaluating apoptosis in situ, assays for 
evaluating cell death in tissue samples can also be used in vivo studies. 

(G) Quantitation of cytokine production 

Cell supernatants harvested after cell stimulation for 16-48 hrs are stored at - 
80°C until assayed or directly tested for cytokine production. Multiple cytokine assays can 
be performed on each sample. IL-2, IL-3, IFN-y and other cytokine ELISA Assays are 
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available for mouse, rat, and human (Endogen, Inc. and BioSource). Cytokine production is 
measured using a standard two-antibody sandwich ELISA protocol as described by the 
manufacturer. The presence of horseradish peroxidase is detected with 3, 3'5, 5' tertamethyl 
benziidine (TMB) substrate and the reaction is stopped with sulfuric acid. The absorbency at 
5 450 nm is measured using a microplate reader. Any statistically significant increase or 
decrease in color intensity of CLASP-2-treated sample, as compared to control sample (no 
treatment), suggest an effect of CLASP-2 on biological function. 

(H) NF-AT can be visualized by Immunostaining 
T cell activation requires the import of nuclear factor of activated T cells 
10 (NFAT) to the nucleus. This translocation of NF-AT can be visualized by immunostaining 
with anti-NF-AT antibody (Cell 1998, 93: 851-861). Therefore, NF-AT nuclear 
translocation has been used to assay T cell activation. Similarly, NF-AT/luei-ferase reporter 
assays have been used as a standard measurement of T cell activation (MCB 1996, 12: 7151- 
7160). 

15 (I) ELISA for collagen type II (CII)-specific antibodies (see above for 

related in vivo assay) 

C(II) titers from serum of animals immunized with CLASP-2 can be measured 
and compared. Both TH1 -dependent IgG2a and TH2 -dependent IgGl and IgE CII- specific 
antibody isotypes will be measured by ELISA. Mouse blood will be obtained by orbital 

20 bleed one and two months post-immunization with CII. Samples will be allowed to coagulate 
and centrifuge to obtain sera, and stored at -80°C until assayed by ELISA. Coat ELISA 
plates with CII and dilute sera. HRP conjugated goat, isotype specific antibody. Plates are 
then expose to TMB substrate and read at 450 nm using a microplate reader (Nabozny et al., 
1996, J. Exp. Med. 183: 27-37). Any change in the levels of Collagen specific antibodies by 

25 colorimetric test when immune responses are generated in the presence of CLASP-2 is 
indicative of a CLASP-2 modulation of this response. 

(J) Antibody Production by ELISPOT Assay 
A solid-phase enzyme-linked immunospot (ELISPOT) assay for the 
quantification of isotype-specific antibody secreting cells (Czerkinsky et aL 9 1983, J 

30 Immunol. Methods. 65: 109-121). Both human and mouse B cells can be tested for isotype 
and antigen specific antibody production. Although based on a standard ELISA, this 
technique becomes more sensitive by detecting antibody secretion from single cells. Any 
change in ELISPOT levels when immune responses are generated in the presence of CLASP- 
2 is indicative of a CLASP-2 modulation of this response. 

107 


3NSOOCID: <WO 0231 1 1 7A2_!_> 


1 


WO 02/31117 PCT/US01/32202 

(K) Cellular degranulation following IgE cross-linking. 

Two cell lines have been obtained from ATCC (MEG01 and HEL-17.92), 
both of which express the human FCeRl receptor. FCeRl is the high affinity receptor for 
IgE complexes, which when coupled to biotin can be cross-linked with avidin to induce 
5 degranulation and histamine release of lymphocytes. Following acylatation of the sample, 
histamine is quantified with an enzyme immunoassay competition assay (Immunotech). 
Histamine release. A statistically significant increase or decrease in histamine concentration 
of a CLASP-2 treated sample, as compared to control sample (no treatment), suggest an 
effect of CLASP-2 on biological function. Any change in frequency of degranulation or 
10 histamine levels when immune responses are generated in the presence of CLASP-2 is 
indicative of a CLASP-2 modulation of this response. 

( L ) Ccl lular phenotyping of lymphocytes by flow cytometry and 

Immunocytochcmistry 

Determining the tissue distribution of lymphocytes following a pathological 

15 disorder can aid in identifying specific organ, tissue and lymphocyte involved in an immune 
response. Cellular phenotyping of lymphocyte trafficking is generally performed with by 
flow cytometry and lmmunocytochemistry. There are several cluster determination (CD) 
molecules that are routinely used to identify phenotype, activation kinetics, and regulation 
events of cells. Any change in levels or distribution of CD molecules when immune 

20 responses are generated in the presence of CLASP-2 is indicative of a CLASP-2 modulation 
of this response. 

(M) Structure/Function Assays: Homotypic and/or Heterotypic, Calcium- 
dependant Cell Adhesion 

L929 cells can be transfected with CLASP-2 and Neomycin. G418-resistant 

25 clones can be screened for CLASP-expression with anti-CLASP peptide-specific antibodies. 

These CLASP-expressing clones can then be used to test for homotypic and/or heterotypic 

calcium dependent cell adhesion using the "cell aggregation assay" described for cadherin 

molecules (Murphy-Erdosh, C. et aL 9 1995, J. Cell Biol. 129: 1379-1390). Any change in the 

levels of cellular aggregation when immune responses are generated in the presence of 

30 CLASP-2 is indicative of a CLASP-2 modulation of this response. 

*** 

The following cDNA clones described in the Specification and further 
described in the Examples below have been deposited with the American Type Culture 
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Collection, 10801 University Boulevard, Manassas, VA 201 10-2209 under the Budapest 

Treaty on March 24, 2000 and given the Accession Nos. indicated: 

hCLASP-2 A 3 5 clone (AVC-PD 1 ) ATCC accession number PTA- 1563 
hCLASP-2 A 5 y clone (AVC-PD2) ATCC accession number PTA- 1 562 
5 hCLASP-2B clone (AVC-PD 12) ATCC accession number PTA-1 573 

The following cDNA clones described in the Specification and further 

described in the Examples below have been deposited with the American Type Culture 

Collection, 10801 University Boulevard, Manassas, VA 20110-2209 under the Budapest 

Treaty on and given the Accession Nos. indicated: 

10 hCLASP-2 clone hC2GR3.3 (AVC-PD 14) ATCC Accession No. 

hCLASP-2 clone hC2RT (AVC-PD 19) ATCC Accession No. 

*** 
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6. EXAMPLES 

EXAMPLE 1 

Cloning ofCLASP-2 
The cloning of the CLASP gene family has not been a straghtforward process. 
The cloning of each CLASP family member required the use of multiple techniques and 
resources. CLASP-2 was cloned in the following manner: an expressed sequence tag or EST 
clone (IMAGE clone 815795, derived from human germinal B cells) was identified based on 
a BLAST search of human GenBank human EST database using CLASP- 1 sequences. 
IMAGE clone 815795 was sequenced completely. A polynucleotide probe prepared from 
815975 sequence was labeled with 32 P-dCTP and used to screen human cDNA libraries 
including Jurkat (Stratagene) and Ramos B cell cDNA library (James Boulter, UCLA). The 
screening methods employed were as described in Maniatis et aL 9 1989, Molecular Cloning A 
Laboratory Manual, Cold Spring Harbor Laboratory, New York. Several clones were 
identified and clone C9> with an insert of 3,752 base pairs, was sequenced (ABI dye- 
sequencing system, PE Applied Biosystems; Perkin-Elmer Corporation, 761 Main Avenue, 
Norwalk, CT, U.S.A.). A 5' probe was prepared from C9 sequence and used to rescreen the 
cDNA libraries. Several clones were isolated, but could not be excised from the phage 
(Stratiagene, CA) without deleting the insert. To circumvent this problem, anchor PCR was 
performed using M13F primer and CLASP-2 primer (C96AS). The PCR fragment was 
cloned using the pGEM-T system (Promega), although initial attempts were unsuccessful. 
The isolated sequence encompassed additional but incomplete cDNA sequence and was 
determined to carry at least one mutation that may have allowed it to be propagated in 
bacteria. Commercial libraries from multiple tissue sources including human placenta, B cell, 
T cell and peripheral blood were exhaustively screened and re-screened resulting in the 
acquisition of only partial cDNAs. Generation of cDNA libraries using oligo dT or CLASP- 
specific primers also resulted in the acquisition of partial cDNAs. Genomic libraries were 
screened to obtain a portion of the genomic locus for each of the CLASP genes, and a 
genomic walk was initiated to obtain 5' exons and extend the cDNA sequence. 

To obtain additional 5' CLASP-2 sequence, portions of the cDNA and 
genomic sequence from a BAC (Bacterial Artificial Chromosome) genomic library were 
compared to the NCBI database by BLAST. A genomic clone (Genbank identifier: 
gi9988160) comprising random, shotgun genomic sequence was identified. Using TFASTX 
(Pearson and Lipman, PNAS (1988) 85:2444-2448), the amino-terminal sequence of human 
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CLASP4 was compared to 6 frame translation of gi9988160. Areas of gi9988160 that 
encoded amino acids with high similarity to CLASP4 amino acid sequence were used to 
design CLASP-2-specific oligonucleotides for RTPCR (reverse transcriptase polymerase 
chain reaction according to manufacturers instructions: Reverse transcriptase Gibco/BRL, 

5 Taq Polymerase from Sigma). Using oligonucleotides hC2gS5 (nucleotides -66 to —44 of 
FIG. 11) and C2AS18 (reverse complement of nucleotides 2120 to 2140 of FIG. 11) an 
RTPCR product of approximately 2.2kb was generated, sequenced (dideoxynucleotide 
termination sequencing, Beckman Coulter CEQ2000) and shown to be additional human 
CLASP-2 5' sequence. Further complicating the cloning fiill-length CLASP cDNA products 

10 was the difficulty to clone (and subclone) certain CLASP cDNA products. Standard isolation 
of some of the CLASP cDNAs from a pure phage population following screening of 
commercially available cDNA libraries ("ZAP-out" procedure, Stratagene) resulted in no 
bacterial colonies. Similarly, certain RT-PCR products could not be cloned into standard 
plasmid vectors. No colonies were isolated by cloning these fragments into vectors lacking 

15 promoters, reverse orientations, low copy vectors, or by growth at altered temperatures or 
levels of antibiotic for plasmid selection (examples: CLASP-7 - HC7gS6 to HC7gASl and 
HC7gS3 to HC7AS14; CLASP-4 - C4P2 to hC4ASTM and C4P2 to HC4AS3'; CLASP-1 - 
hClS5' to hClAS3'Kpn and C1S7 to hClAS3'Kpn; see Primer Table below). One 
possibility is that sequences contained within certain regions of CLASP cDNAs are 

20 bacteriacidal and therefore not amenable to cloning. To circumvent these problems direct 
sequencing of RT-PCR products was performed. 
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Primer Table 


CLASP 
gene 

Sense 
Primer 

Sense sequence 

Antisense 
Primer 

Antisense sequence 

CLASP-7 

HC7gS5 

AGGCCTTGTCTCTGTTTACCTG 

HC7gASl 

TGTCATGTACTGCACTCGCACAGC 

CLASP-7 

HC7gS3 

ACAGGAACCTGCTGTACGTGTAC 

HC7ASH 

TCGTGGCTGCACAGGATGCGGGTG 

CLASP-4 

C4P2 

GACCCATTAGGAGGTCTAC 

HC4AS3* 

CGGGATCCATTGTCACCGTACATCT 
GC 

CLASP-4 

C4P2 

GACCCATTAGGAGGTCTAC 

HC4AS3* 

CGGGATCCATTGTCACCGTACATCT 
GC 

CLASP- 1 

hOS5* 

TATGTCTCAGTCACCTACCTG 

HClAS3*Kpn 

CTTGGTACCACTTCAGCACTAGATG 
AGATG 

CLASP-1 

C1S7 

TCAAGACCAGGGCATGCAAG 

HClAS3'Kpn 

CTTGGTACCACTTCAGCACTAGATG 
AGATG 


In-frame stop codons were not present suggesting that the cDNA was not full 
length. To obtain the 5' terminus of CLASP-2 5* RACE was employed. Antisense 
oligonucleotides directed against the 5' end of the longest CLASP-2 sequence were 
generated: 

Primers used for human CLASP-2 5 5 RACE 

primer sequence rs' TO 3') nucleotide position 

HC2RACE1 

AAG AGC AGC AT C TC C CGTAAAC AGTC -15 to 11 

HC2RACE2 

TAACAAGCTCTGTGCTTCCTCTTCCG 414 to 443 

• HC2RACE3 

ACCACTTTGTTCGGAAGCTGTCGAAACTC 512 to 540 

HC2RACE4 

TTTGTACAGCCAGCCATGCTTGGTGATC 634 to 661 

RACE was carried out using Generacer kit (Invitrogen) according to 
manufacturers specifications using polyA selected mRNA from 9D10 B cell tissue culture 
line. The sequence of the oligonucleotides presented is the reverse complement (i.e., 
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antisense) of the the CLASP 1 cDNA at the indicated position based upon numbering in FIG. 
11. 

The full length cDNA (presented in FIG. 1 1 ) is therefore a compilation of 
cDNA from cDNA libraries, RTPCR products and 5' RACE products. The sequence of the 
5 CLASP-2 cDNA is shown in FIG. 1 1 . 

EXAMPLE 2 

Tissue and Cell Line Expression of the CLASP-2 gene 
Multiple Tissue Northern blots were purchased from Clontech; hybridization 
procedures were followed according to manufacturer's procedures and recommendations. 

10 Human T cell line (Jurkat), human myelomonocyte cells (MV4-1 1), B cells (9D10), 

monocytes (THP-1), mouse T cells (3A9), mouse B cells (CH27). human promyelocyte 
(HL60) and human kidney epithelial cells (293 cell line) were maintained as cultured cell 
lines. For Multiple Cell Northerns, RNA was prepared from cell suspensions using the 
GIBCO-BRL Trizol system. All steps were performed according to the manufacturer's 

15 procedures and recommendations. RNA concentrations were determined by the 

260nm/280nm light absorption of the RNA solution. 20 fig RNA was ethanol precipitated and 
resuspended in formamide/formaldehyde buffer and incubated for 15* at 65°C to eliminate 
putative secondary structures. RNA samples were run over night on a 1.1% agarose gel 
containing 1.5% formaldehyde (both gel and running buffer were 20 raM sodium phosphate, 

20 pH 7.5), To visualize RNA after gel migration, approx. 0.5 \ig ethidium bromide was added 
to each sample prior to the run together with RNA loading buffer. RNA in the gel was then 
visualized by 260nm wavelength light. After soaking the gel for 15' in deionized water to 
reduce the concentration of ethidium bromide in the gel, the RNA was transferred onto 
Amersham Hybond-N plus membrane by capillary blotting in 20 x SSC buffer for 5 hours. 

25 Subsequent to blotting, the membrane was washed in 5 x SSC for 3' and RNA was 
crosslinked to the membrane by UV light (Stratagene Stratalinker). 

A probe which recognizes CLASP-2 isoforms A, B, C, and D (probe HC2.2) 
was used. Probe HC2.2 encompasses to nucleotides 3920 to 4650 (731 bp long) of CLASP- 
2A. The HC2.2 probe was prepared using standard labeling kits and desalted using pasteur 

30 pipette G-50 Sephadex column in TEN (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, and 100 
raM NaCl). 

Hybridizations of P dCTP labeled DNA probes to the membrane bound 
RNAs (multiple tissue and multiple cells) were carried out in CLONTECH EXPRESSHYB 
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solution, at 68°C and for 1-2 hours. Blots were washed 2 times in 2x SSC 0.1% SDS for 10' 
each at 50°C and then twice in 0.2 x SSC 0.1% SDS for 10' each at 50°C, followed by a 5' 
wash in 2xSSC at 50°C. Exposure to KODAK BIOMAX MS film was carried out at minus 
80°C using amplifying screens. Typical exposure times were 10 to 36 hours. 

5 EXAMPLE 3 

Southern Analysis of CLASP-2 
BAC DNA was prepared from E. coli over night cultures using the QIAGEN 
DNA preparation system. All preps were performed according to the manufacturer's 
procedures, including the modifications for low copy number DNA constructs. Genomic 

10 DNA was prepared from HeLa cells (ATCC #CCL-17) using the methods described by 

Sambrook, Fritsch and Maniatis (1989); DNA concentrations were determined by the 260nm 
light absorption of the DNA solution, and aliquots corresponding to 20 microgram (\ig) 
genomic DNA or 2 \ig for BAC DNA were used for restriction en2yme digests with Eco RI 
or HinD HI (genomic DNA) or Eco RI and Pst I (BAC DNA). Digests were carried out in 

15 150 microliter volume for 4 hours at 37°C. Digested DNA was ethanol precipitated and the 
pellet was resuspended in 20 microliter deionized water prior to migration over a 1 .2 % 
agarose gel at 35 V over night. Running buffer was TAE, and the gel contained 0.1 (ig 
ethidium bromide/ml to visualize DNA. 

Subsequent to gel separation, DNA was visualized by 260 nm wavelength 

20 light. The gel was then washed twice for 20' in denaturing buffer (0.5M NaCl, 0.4 N NaOH) 
and twice in neutralization buffer (1.5 M NaCl, 0.5 M TRIS pH 8.0). DNA was transferred 
from the gel onto AMERSHAM HYBOND N membrane by capillary blotting in 20 x SSC 
for 5 hours. The DNA was crosslinked to the membrane by UV light using a Stratagene 
Stratalinker. 

25 A probe, HC2.1, which recognizes CLASP-2, was used. Probe HC2.1 

encompasses nucleotides 325 to 1 126 (802 bp long) of CLASP-2A. The HC2.1 probe was 
prepared using standard labeling kits and desalted using pasteur pipette G-50 Sephadex 
column in TEN (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, and 100 mM Nacl). Hybridizations 
of 32 P dCTP labeled DNA against DNA immobilized onto the membrane were carried out at 

30 65°C overnight in modified CHURCH hybridization solution (7% SDS, 0.5 M 

sodiumphosphate, ImM EDTA). Membranes were then exposed to KODAK BIOMAX MS 
film at minus 80°C. Typical exposure times were 12 hours for genomic DNA southern 
analysis and 3 hours for BAC DNA Southern analysis. 
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The genomic DNA southern analysis revealed two fragments (-4.5 kb and 
1 .85 kb) in the Eco RI digested DNA but three fragments in BACs 4 and 6 DNA. The two 
major bands axe identical in both genomic and BAC DNA (FIG. 7). 

EXAMPLE 4 

5 CLASP-2 Genomic Cloning 

Genomic clones of human CLASP-2 were obtained using the Release I high 
density filters from Genome Systems Inc (cat # FBAC-4434). Two rounds of screening were 
completed. The first round of screening was carried out using a probe corresponding to 
nucleotides 3830 to 4558 of the human CLASP-2 cDNA by standard protocols specific by 

10 Genome Systems. This screen identified two genomic clones, referred to as AVC BAC4 and 
7. A second round of screening using a probe that corresponded to nucleotides 1208 to 1604 
of human CLASP-2 cDNA identified clone AVC BAC26. All the clones were partially 
sequenced to authenticate that they were indeed CLASP-2 genomic clones, to verify exon 
sequences, and to identify exon/intron boundaries. Oligonucleotides for sequencing the 

15 BACs were based upon human CLASP-2 cDNA sequence. Sense and antisense sequencing 
oligonucleotides were designed along the length of the human CLASP-2 cDNA spaced 
approximately every 200 nucleotides to ensure a high density of coverage of the 
corresponding genomic regions. Sequencing reactions with primers and BAC DNA were 
carried out by standard PGR sequencing using Big Dye termination sequencing mix (ABI). 

20 Results from sequence reactions were analyzed using Sequencher software (Genecodes). The 
results are summarized in FIG. 6. 

EXAMPLE 5 

Expression of Recombinant CLASP-2A Polypeptide in Bacterial Cells 
Portions of hCLASP-2 were cloned into the GST expression vector pGEX 

25 (Pharmacia). These include the region spanning the potential Cadherin processing site 

through 200 amino acids of the predicted extracellular domain (nucleotide 866 - 1459; GST- 
EC 12; 55 kD fusion) and a portion of the intracellular domain (nucleotide 3230 - 4065; GST- 
cyto; 57 kD fusion). These regions were amplified using primers at the limits of these 
sequences on either cDNA clones or cDNA generated from Jurkat or Human Peripheral 

30 Blood RNA. Amplified DNA sequences were digested with restriction enzymes for cloning 
in-frame into GST expression vectors. Fusion proteins were expressed by IPTG induction in 
DH5a and purified according to instructions from Pharmacia using glutathione-Sepharose 
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(Pharmacia). SDS-PAGE gel stained with Coomassie Blue showing induced and uninduced 
expression of the GST-CLASP-2-cyto construct is shown in FIG. 8. These recombinant 
proteins were expressed in DH5a and purified according to instructions from Pharmacia 
using glutathione-Sepharose. Such recombinant proteins were used to generate antibodies 
(Josman laboratories) using a AVC Rapid Immunization Protocol. 

The full length CLASP can easily be expressed from either the beginning of 
the hCLASP-2 sequence (in frame with nucleotide 2) or from the first or second methionine 
(nucleotide 278 or nucleotide 476, underlined in FIG. 1) through to the stop codon 
(nucleotide 4058). Assuming that the GST moiety has a weight of 26 kD, the total predicted 
sizes are 180, 168, and 164.5 kD respectively. Alternatively, other bacterial expression 
systems such as 6CLASP HIS tags, Calmodulin binding protein, maltose binding protein can 
also be used in a similar manner. 

EXAMPLE 6 

Expression of Recombinant CLASP-2A Polypeptide in Mammalian Cells 
Example 6A . Secreted fusions 

Several portions of the predicted extracellular domain were constructed as 
hlgG fusions using the CD5gamma-l expression vector (kindly provided by B. Seed, Harvard 
University). Polypeptides were cloned into this vector in frame with a CDS leader sequence 
that directs the fusion protein into the secretory pathway and in frame with a C-terminal 
hIgG(Fc) protein. This fusion can be secreted from cell lines such as 293 (Hsieh, J-Q, 1999, 
Nature 398: 431-436). Sense primers with hCLASP-2 sequences beginning at nucleotide 866 
and antisense primers at nucleotide 1459 (EC12-IgG), nucleotide 2389 (ECC-IgG) and 
nucleotide 2857 (ECM-IgG) were used to amplify portions of the extracellular domain for 
insertion into this vector. Recombinant vectors were purified by Maxiprep (Qiagen) and 
transfected into 293 EBNA- T cells (kindly provided by B. Seed, Harvard University) by 
calcium phosphate techniques (Sambrook and Maniatis). After 2-7 days, secreted expression 
was analyzed by an ELISA against the hlgG fusion using a goat F(ab')2 anti human IgG(Fc) 
antibody (Jackson Immunolabs) and Protein- A-HRP (Pierce). Intracellular expression was 
monitored by immunofluorescence microscopy with a FITC labeled goat anti Human IgG(Fc) 
antibody (Caltag). 
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Example 6B . Intracellular fusions 

Similar methods have been used to construct fusions for expression of full 
length hCLASP-2 isoforms as well as truncated C-terminal forms in other cell lines such as 
Jurkat. Recombinant hCLASP-2 fragments were either isolated by digestion of cDNA clones 
5 or amplified by primers flanking specific regions (Please provide some specific regions). 
These can be cloned into expression vectors such as pBJl-neo (Mark Davis, Stanford 
University). Peak 12 (B. Seed, Harvard University), and pDsRedl-Nl (Clontech). pBJl-neo 
and Peak 1 2 allow untagged expression of recombinant proteins and pDsRedl-Nl will allow 
either untagged or a C- terminal Red fluorescent protein tag. These can be used to generate 
10 protein or for expression of various forms for functional analyses. 

EXAMPLE 7 

Antisense Inhibition of CLASP-2 Expression 
Fxample 7A . Inhibition of CLASP-2 expression in vitro 
In this example, inhibition of CLASP-2 expression is examined using an in 

15 vitro cell-free expression system. To identify the useful antisense oligonucleotides, a series 
of antisense phosphorothioate oligonucleotides (PS-ODNs), which span portion CLASP-2 
sequence, can be systematically assayed for the ability to block CLASP-2 expression in vitro. 

For inhibition of CLASP-2 expression in vitro, a CLASP-2 
transcription/expression plasmid can be used according to standard methodology for in vitro 

20 transcription and translation of sense CLASP-2 RNA. Coupled transcription-translation 

reactions can be performed with a reticulocyte lysate system (Promega TNTTM) according to 
standard conditions. Each coupled transcription/translation reaction can include CLASP-2 
RNA transcribed from the expression plasmid, and a test antisense polynucleotide at a range 
of standard test concentrations, as well as the luciferase transcription/translation internal 

25 control to normalize each reaction (see, e.g., Sambrook et al 9 supra, Ausubel et al 9 supra). 
The translation reaction can also be performed with sense CLASP-2 RNA that is synthesized 
in vitro in a separate reaction and then added to the translation reaction. 35 S-Met is included 
in the reaction to label the translation products. The negative control is performed without 
added PS-ODN or a sense PS-ODN. 

30 The labeled translation products can be separated by gel electrophoresis and 

quantitated after exposing the gel to a phosphorimager screen. The amount of CLASP-2 
protein expressed in the presence of CLASP-2 specific PS-ODNs can be normalized to the 
co-expressed luciferase control. 
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Example 7B . Inhibition of CLASP-2 expression ex vivo 

A. Reagents ^ 

Cells: Jurkat, Clone E6-1 ATCC TIB-152; 9D10 ATCC CRL8752; additional 

cells from the ATCC or NCI. 

Media and solutions: RPMI 1640 medium, BioWhitaker; DMEM/M199 
medium, BioWhitaker; EMEM, BioWhitaker; Fetal Bovine Serum, Summit (stored frozen at 
-20°C, stored thawed at 4°C); Trypsin-EDTA, GD3CO (catalogue #25300-054) (stored frozen 
at -20°C, stored thawed 4°C; Isoton II (stored at RT); DMSO (stored at RT); oligonucleotides 
(see Table 1 and FIG. 3, stored in solution at -20°C); PBS (Ca 2+ /Mg 2+ free); TE; 10 mM Tris- 
HCL, pH 8.0; 1 mM EDTA. 

To prepare oligonucleotide stocks: Oligonucleotide nucleotides (PS-ODNs) 
can dissolved in the appropriate amount of TE to make a concentrated stock solution (1-20 
mM). 

B. Treatment of cells ex vivo with antisense CLASP-2 oligonucleotides 
Stock cultures of cells in log-phase growth (in T75 flask) can be used. Jurkat, 

and 9D10 cells are used in this assay. Jurkat and 9D10 are suspension cultures and are 
passed through dilutions in media. Cell density is measured using a Coulter counter or 
hemacytometer. 

For 6-well dishes, 1.1 x 10 5 cells total per well, 2 ml/well is added. The 
amount of cells can be scaled up or down proportionally for 12- well, 100 mm, or 150 mm 
dishes. For example, for 12-well dishes, use 4.6 x 10 4 cells in 2 ml media; for 100 mm dishes 
use 6 x 10 5 cells in 10 ml media; for 150 mm dishes use 1.7 x 10 6 cells in 35 ml media. 

An appropriate number of cells (as described in step 2 above) are collected, 
centrifuged and resuspended in media containing a range of ODN concentrations. The cells 
are treated in single, duplicate, or triplicate wells. Control wells are treated with TE or sense 
ODNs diluted in media. 

The suspension cultures are washed and resuspended daily with PS-ODN 

media. 

Suspension cultures are grown for 2-4 days. Cells are washed with PBS and 
density measured using a Coulter counter or a hemocytometer. If necessary, the cells are 
replated at 1.1 x 10 5 cells per well, 2 ml media per well, and fed with PS-ODN as described 
above. 
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Samples of the cells can also be harvested for analysis to determine the effects 
of CLASP-2 antisense ODNs. Samples are harvested for RNA and analyzed by either 
Northern analysis or RT-PCR for the presence of CLASP-2 mRNA. Functional 
consequences of CLASP-2 antisense ODNs can be analyzed by measuring the ability of 
Jurkat and 9D10 cells to be activated. Jurkat cells are activated by exposure to anti-CD3 and 
anti-CD28 crosslinking antibodies, and 9D10 cells are activated by exposure to anti-IgM 
crosslinking antibody or P. aeruginosa lipopolysaccharide. A hallmark of activation, calcium 
influx, can be measured by flow cytometry. Additionally, ELISA assays can be used to 
measure Interleukin-2 production from Jurkat cells and secreted IgM can be measured using 
standard assays from 9D10. 

Table 5 below shows exerrtpiafy oligonucleotides for this assay: 


Table 5 


Oligo 

Sequence 5*- 3' 

length 

notes/ comments 

1 

GAAGGCGATCATCACGT 
GGCCTTCCATCGC 

30-mer 

encompasses nucleotides 473-502 and spans the 
putative initiator methionine (underlined). The 
function of HC2A, 2B, 2C, and 2E isoforms can be 
eliminated by this oligonucleotide. 

2 

GCTTCAAGTAATGACTGG 
TGCAGAACATCTG 

3 1 -mer 

Oligonucleotide that should recognize HC2A, 2B, 
2D, 2E, and 2F. Encompasses nucleotides 2121- 
2151. Can be eliminate function of these CLASP-2 
isoforms. 

3 

GCTCCTCCTCAGGCAGGC 
GCTATGGCTGTGG 

34-mer 

oligonucleotide specific for HC2C based upon a 
specific exon found at nucleotide 2927. Can 
eliminate only HC2D function. 

4 

GTAGGCCCGGTGCAGCGT 
GTCATACAGATGG 

3 1 -mer 

oligonucleotide specific for HC2B, 2C, 2D and 2E 
based upon specific exon sequence found at 
nucleotide 3153. Can eliminate function of these 
CLASP-2 isoforms. 

5 

GCAATGTCTGAGACTTTC 
GATCATGAACTATG 

32-mer 

oligonucleotide specific for HC2A, 2B, 2E, and 2F. 
Encompasses nucleotides 1987-2018. Can 
eliminate function of these CLASP-2 isoforms. , 

6 

CAGGAGCTGGTTCTTAAA 

1 8 -mer 

oligonucleotide specific for HC2A, 2D and 2E. 
Encompasses nucleotides 2219-2224. Can 
eliminate function of these CLASP-2 isoforms 


Table 5 legend. All nucleotide numeration are relative to Human CLASP-2A (HC2A). See FIG. 2A. 


EXAMPLE 8 

Example 8A . Synthesis of carboxvl-termini PDZ-ligand peptides 

The GST-PDZ fusion proteins are made following standard procedures. An 
exemplary GST-PDZ fusion protein was constructed as follows: A 572 bp fragment encoding 
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two PDZ domains of the human neDLG gene (Genbank Accession No. U49089.1) was 
amplified from total Jurkat RNA by RT-PCR according to standard protocols (Sambrook, 
Fritsch, and Maniatis, 1989, Molecular Cloning - A Laboratory Manual. Cold Spring Harbor 
Press.) using primers flanked by restriction endonuclease sites for cloning. Fragments were 
purified by Sephaglas (Pharmacia), digested with the appropriate enzymes, and ligated into 
the GST expression vector pGEX-3X (Pharmacia) cut with similar enzymes. Recombinant 
constructs were confirmed by sequencing. Fusion proteins were expressed by IPTG 
induction in DH5a and purified using glutathione-Sepharose (Pharmacia) according to 
instructions from Pharmacia. Excess glutathione was removed using a PD10 desalting 
column (Pharmacia) and samples were diaconcentrated by placing the protein in dialysis 
tubing (14,000 M W cutoff) and laying the tubing on polyethylene glycol (3350; Sigma) until 
volume had been reduced by approximately 50%. Glycerol was then added to 35% final 
concentration and samples were stored at— 20°C. These recombinant proteins have been used 
to generate antibodies (Josman laboratories) by standard protocols and for biochemical 
studies describe herein. 

Synthetic peptides corresponding to the carboxyl-terminus of a protein of 
interest are synthesized by standard resin-based chemistry (e.g., FMOC), labeled with biotin 
at the ammo-terminus when indicated, and cleaved from the resin using a halide containing 
acid (e.g., trifluoroacetic acid). The synthetic peptides are then purified by reverse phase 
high performance liquid chromatography (HPLC) and the identity of the peptides are 
confirmed by mass spectrometry. 

Example 8B . Measurement of CLASP-2 peptide binding to PDZ Domain- 
containing proteins 

The binding of a biotinylated carboxyl-terminal peptide to a GST-PDZ fusion 
protein is measured as follows: 

(1) GST fusion protein containing one or more PDZ domain(s) is coated 
onto a protein-binding surface. The protein-binding surface is the surface of a 
polystyrene plate, which in some cases has been pre-treated by coating with 5 |ig/ml of 
goat-anti-GST polyclonal antibody followed by blocking with excess bovine serum 
albumin (BSA). The concentration of GST fusion protein used is 5—10 pg/ml and the 
reaction of the GST fusion protein with the plate is carried out in PBS for 1 - 16 hours at 
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4°C. If not already blocked, the plate is then blocked with BSA (2% in PBS, 2 hours, 
4°C) 

(2) The plate is washed with PBS. 

(3) The biotinylated peptide (generally 0.2—20 (aM) is then added to the 
plate and allowed to react in PBS/2% BSA buffer with the GST fusion protein for 10 
minutes at 4°C followed by 20 minutes at 25°C. In cases where competition between a 
labeled (biotinylated) and unlabeled (non-biotinylated) peptide is performed, the 
unlabeled peptide is added immediately prior to adding the labeled peptide. . 

(4) The plate is washed with PBS. 

(1) 0.5 fig/ml steptavidin-HRP conjugate is added to the plate in PBS/2 % 
BSA buffer and allowed to react for 20 minutes at 4°C. 

(6) The plate is washed 5 X with detergent (tween 20) containing solution. 

(7) The plate is developed by addition of HRP-substrate solution for 20 
minutes at room temperature. 

(8) The reaction of the HRP and its substrate is terminated by addition of 1 
M sulfuric acid. 

(9) The optical density of each well of the plate is read at 450 nm. 

In cases where measurement of the apparent affinity of PDZ-ligand interaction 
is desired, the above procedure is carried out with multiple concentrations of the labeled 
peptide being used in a single experiment. A plot of binding versus peptide, concentration 
added is then fit to the equation: 

Binding [peptide] = Saturation Binding x ([peptide] / ([peptide] + Kd)) 

where "Binding [peptide]" is the binding of a given concentration of peptide to 
the GST-PDZ fusion protein minus binding to the GST alone control, "Kd" is the apparent 
affinity of the binding reaction, and "Saturation Binding" is computed to allow the best fit of 
the data to the above equation. The term apparent affinity is used because the reaction may 
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not reach equilibrium during the duration of the binding reaction in which case the apparent 
affinity would underestimate the actual affinity {i.e., actual Kd < observed Kd). 

EXAMPLE 9 

Expression of human CLASP -2 in activated T-cells 

General experimental design 

The expression profiles of human CLASP-2 in T cells upon T cell activation 
was determined by Northern analysis. Jurkat E6 lymphoblasts were activated by treatment 
with anti-CD28, PMA 3 and Ionomycin. Subsequently, total RNA was extracted from cell 
aliquots harvested at 0, 1, 2, 4, 8, and 14 hours post activation. The RNA concentration of 
each preparation was deteimined by the absorption at 260 nm using a spectrophotometer and 
concentrations of the different RNA preparations were adjusted such that equal quantities of 
each RNA preparation could be subjected to Northern analysis. Even gel loading was 
monitored by ethidium bromide staining of the formaldehyde-agarose gel. Northern 
membranes were hybridized to radioactively labeled probes corresponding to portions of 
human CLASP-2 and human beta-actin. Expression levels of CLASP-2 at different time 
points post T-cell activation are proportional to the radioactive signal generated by 
hybridization by the CLASP-2 specific radioactively labeled probe that remained bound to 
the Northern membrane under stringent washing conditions. The entire experiment was done 
in duplicate. 

Jurkat E6 cell activation 

Jurkat E6 cells were maintained and tested in complete IMDM medium 
supplemented with 2 mM glutamine, 10 mM HEPES, 100 u/mL penicillin, 100 tig/mL 
streptomycin, 0.1 mM nonessential amino acids, 1 mM sodium pyruvate (Gibco/BRL), 50 
pM beta mercaptoethanol (Sigma), and 10% fetal calf serum (Gemini). T cells were 
activated as described per Fraser et al., using 0.1 g/mL mouse anti-human CD28 monoclonal 
antibody (PharMingen International catalog number 33741 A), 50 ng/mL PMA (Sigma), and 
1 |^M ionomycin (Calbiochem). Following incubation at 37°C and 5.0% v/v C0 2 , 0.5 xlO 6 
cells were harvested by centrifugation at 500 x g for 10 minutes (min) at room temperature at 
0, 1, 2, 4, 8 and 14 hours post activation and subjected to RNA extraction. 

For RNA preparation, probe labelling and Northern analysis protocols, see 
methods and procedures described in Example 2 above. The CLASP-2 specific probe 
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encompassing nucleotides 5352 to 5922 was generated by PCR from a plasmid containing 
cloned CLASP-2 cDNA sequences using primers C2S12 and C2AS21. 

Hybridization, Washing, and Exposure 

Blots were washed twice in 2x SSC 0.1% SDS for 10 min each at 60° C and 
5 then twice in 0.2x SSC 0. 1% SDS for 10 min each at 60° C, followed by a 5' wash in 2xSSC 
at 60° C. Exposure to KODAK BIOMAX MS film was carried out at minus 80° C using 
amplifying screens. Typically, exposure times were 10 to 36 hours. Signal intensities on 
Northern membranes were quantified by the use of a phosphor imager system (STORM, 
Molecular Dynamics). Signals were counted in the "volume report" mode. 

10 Results 

CLASP-2 expression levels as determined by Northern analysis (FIG. 14) 
slightly decrease at 1 hour post activation. The maximum decrease of approximately 36 % is 
seen at 2 hours post activation. Expression levels augment again at 4 hours post activation 
but do not attain the level that is seen before activation (0 hours). Intensities of CLASP-2 - 

15 specific signals on the Northern blot were quantified by phosphor imager analysis. 

Rectangles were drawn around the areas of CLASP-2-specific signal and total quantity of 
signal was determined by the "volume report" mode; phosphor imager quantification results 
of two entirely independent experiments are shown in the diagram (green bars corresponds to 
Northern blot shown). The above result suggests, that transcriptional control of CLASP-2 

20 expression and T-cell activation are functionally linked to each other. 

EXAMPLE 10 

Chromosomal location of CLASP-2 and possible disease associations 
CLASP-2 cDNA sequences have been mapped to the genomic clone 
(GI:9926440, GL9988160) by use of sequence homology bioinformatics tools BLAST. 
25 Clone (GL9926440, GL9988 1 60) has previously been mapped to the 

chromosomal location 13ql2-ql3. The literature research reports that the mutations, 
deletions, rearrangements, disomies and/or breakpoints (in general: chromosomal aberations) 
in below listed genes make the genes strong candidates for the onset of the listed 
diseases/disorders. Because the CLASP-2 gene is localized in the chromosome location 
30 13ql2-ql3, abnormal CLASP-2 gene regulation or deletion, rearrangement and/or mutations 
in CLASP-2 locus might be directly or indirectly associated with the onset of the listed 
diseases. Further, CLASP-2 gene can be used as a genetic probe to detect the abnormality in 
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regions of these below listed genes and as a diagnostic marker for the related 
disease/disorders. 


CANDIDA TE 
GENES 

LOCUS 

RELATED 
DISEASE/DISORDERS 

IPF1: Insulin 

promoter 

factorl 

13ql2.1 

MODY4: non insulin-dependent juvenile type. 

ST J Mr * 

Defect in pancreatic islet development and 
insulin transcription. 

BRCA2 

13ql2.3 

BCLL2: B cell lymphoma, deletion 
encompassing BRCA2 causes B cell 
lymphoma. 

BRCA2 is one of the responsible genes for 
DNA repairing in S phase. 


13ql3.1-ql4.3 

Deletion of these locus causes MDS6: Myelo 
dysplastic syndrome type 6 
including AML. 


The present invention is not to be limited in scope by the exemplified 
embodiments which are intended as illustrations of single aspects of the invention, and any 
clones, DNA or amino acid sequences which are functionally equivalent are within the scope 
of the invention. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
appended claims. It is also to be understood that all base pair sizes given for nucleotides are 
approximate and are used for purposes of description. 

All publications and patent documents cited above are hereby incorporated by 
reference in their entirety for all purposes to the same extent as if each were so individually 
denoted. 

*** 
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WHAT IS CLAIMED IS: 

1 1 . An isolated CLASP-2 polynucleotide, wherein said polynucleotide is 

2 (a) a polynucleotide that has the sequence of SEQ ID NO: 1 , 3, 5 or 9; or 

3 (b) a polynucleotide that hybridizes under stringent hybridization conditions to 

4 (a) and encodes a polypeptide having the sequence of SEQ ID NO: 2, 4, 6 or 10 or an allelic 

5 variant or homologue of a polypeptide having the sequence of SEQ ID NO: 2, 4, 6 or 10; or 

6 (c) a polynucleotide that hybridizes under stringent hybridization conditions to 

7 (a) and encodes a polypeptide with at 25 contiguous residues of the polypeptide of SEQ ED 

8 NO: 2, 4, 6 or 10; or 

9 (d) a polynucleotide that hybridizes under stringent hybridization conditions to 

10 (a) and has at least 12 contiguous bases identical to or exactly complementary to SEQ ID NO: 

11 1,3, 5or9. 

1 2. The polynucleotide of claim 1 , wherein said polypeptide specifically 

2 binds to a PDZ domain of PSD95, DLG1 or neDLG. 

1 3. The polynucleotide of claim 2, wherein said polypeptide has a binding 

2 affinity of at least 1 0 4 M" 1 for binding PSD95, DLG1 or neDLG. 

1 4. The polynucleotide of claim lthat encodes a polypeptide having the 

2 full-length sequence of SEQ ID NO: 2, 4, 6 or 10. 

1 5. The isolated polynucleotide of claim 1, comprising the cDNA coding 

2 sequence of ATCC Deposit Nos. PTA-1562 and PTA-1563 and PTA-1573. 

1 6. An isolated CLASP-2 polynucleotide comprising a nucleotide 

2 sequence that has at least 90% percent identity to SEQ ID NO: 1, 3, 5 or 9. 

1 7. An isolated polypeptide comprising a nucleotide sequence that has at 

2 least 90% sequence identity to SEQ ID NO: 2, 4, 6 or 10 and is immunologically 

3 crossreactive with SEQ ID NO: 2, 4 5 6 or 10 or shares a biological function with native 

4 CLASP-2. 

1 8. A vector comprising the polynucleotide of claim 1 . 
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1 9. An expression vector comprising the polynucleotide of claim 1 in 

2 which the nucleotide sequence of the polynucleotide is operatively linked with a regulatory 

3 sequence that controls expression of the polynucleotide in a host cell. 

1 10. A host cell comprising the polynucleotide of claim 1, or progeny of the 

2 cell. 

1 1 1 . A host cell comprising the polynucleotide of claim 1 , wherein the 

2 nucleotide sequence of the polynucleotide is operatively linked with a regulatory sequence 

3 that controls expression of the polynucleotide in a host cell, or progeny of the cell. 

1 12. The host cell of claim 1 0 which is a eukaryote. 

1 13. The polynucleotide of claim 1 that is an antisense polynucleotide less 

2 than about 200 bases in length. 

1 14. An antisense oligonucleotide complementary to a messenger RNA 

2 comprising SEQ ID NO: 1, 3, 5 or 9 and encoding CLASP-2, wherein the oligonucleotide 

3 inhibits the expression of CLASP-2. 

1 15. An isolated DNA that encodes a CLASP-2 protein as shown in SEQ ID 

2 NO: 2, 4, 6 or 10. 

1 16. The polynucleotide of claim 1 that is RNA. 

1 17. A method for producing a polypeptide comprising: 

2 (a) culturing the host cell of claim 10 under conditions such that the 

3 polypeptide is expressed; and 

4 (b) recovering the polypeptide from the cultured host cell or its cultured 

5 medium. 

1 18. An isolated polypeptide encoded by a polynucleotide of claim 1 (a) or 

2 (b). 

1 19. The polypeptide of claim 18 that has the amino acid sequence of SEQ 

2 ID NO: 2, 4, 6 or 10, or a fragment thereof. 
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1 20. The isolated polypeptide of claim 18, wherein the polypeptide is cell- 

2 membrane associated. 

1 21. The isolated polypeptide of claim 1 8, wherein the polypeptide is 

2 soluble. 

1 22. The polypeptide of claim 19, wherein the polypeptide is fused with a 

2 heterologous polypeptide. 

1 23. An isolated CLASP-2 protein having the sequence as shown in SEQ 

2 ED NO: 2, 4, 6 or 10. 

1 24. A protein comprising the sequence as -shown- in SEQ. ED. NO: 1 and 

2 variants thereof that are at least 95% identical to SEQ ID. NO: 2 and specifically binds 

3 spectrin. 

1 25. An isolated antibody that specifically binds to a polypeptide having the 

2 amino acid sequence as shown in SEQ ED NO: 2, 4, 6 or 10, or a binding fragment thereof. 

1 26. The antibody of claim 25, that is monoclonal. 

1 27. A hybridoma capable of secreting the antibody of claim 26 

1 28. A method for identifying a compound or agent that binds a CLASP-2 

2 polypeptide comprising: 

3 i) contacting a CLASP-2 polypeptide of claim 19 with the compound or agent 

4 under conditions which allow binding of the compound to the CLASP-2 polypeptide to form 

5 a complex and 

6 ii) detecting the presence of the complex. 

1 29. A method of detecting a CLASP-2 polypeptide in a sample, 

2 comprising: 

3 (a) contacting the sample with an antibody or binding fragment of claim 26 

4 and (b) determining whether a complex has been formed between the antibody and with 

5 CLASP-2 polypeptide. 
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1 30. A method of detecting a CLASP-2 polypeptide in a sample, 

2 comprising: 

3 (a) contacting the sample with a polynucleotide of claim 1 or a polynucleotide 

4 that comprises a sequence of at least 12 nucleotides and is complementary to a contiguous 

5 sequence of the polynucleotide of section (a) of claim 1 , and (b) determining whether a 

6 hybridization complex has been formed. 

1 3 1 . A method of detecting a CLASP-2 nucleotide in a sample, comprising: 

2 (a) using a polynucleotide that comprises a sequence of at least 12 nucleotides 

3 and is complementary to a contiguous sequence of the polynucleotide of section (a) of claim 

4 1, in an amplification process; and 

5 (b) determining whether a specific amplification product has been formed. 

1 32. A pharmaceutical composition comprising a polynucleotide of claim 1 , 

2 a polypeptide of claim 1 8, or an antibody of claim 25 and a pharmaceutically acceptable 

3 carrier. 

1 33. A method of inhibiting an immune response in a subject comprising: 

2 (a) interfering with the expression of a CLASP-2 gene; 

3 (b) interfering with the ability of a CLASP-2 protein to bind to another cell; 

4 (c) interfering with the ability of a CLASP-2 protein to bind to another protein. 
1 34. The method of claim 33, wherein the cell is a T cell or a B cell. 

1 35. The method of claim 33 comprising contacting the cell with an 

2 effective amount of a polypeptide which comprises the amino acid sequence of SEQ ID NO: 

3 2, 4, 6 or 10 or a fragment thereof. 

1 36. A method of inhibiting an immune response in a subject, comprising 

2 administering to the subject a therapeutically effective amount of an antibody which 

3 specifically binds a polypeptide having the sequence of SEQ ID NO: 2, 4, 6 or 10. 
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1 37. A method of preventing or treating a CLASP -2-mediated disease 

2 comprising administering to a subject in need thereof a therapeutically effective amount of a 

3 pharmaceutical composition of claim 32. 

1 38. The method claim 37, wherein the CLASP-2-mediated disease is an 

2 autoimmune disease. 

1 39. A method of treating an autoimmune disease in a subject caused or 

2 exacerbated by increased activity of ThI cells consisting of administering a therapeutically 

3 effective amount of a pharmaceutical composition of claim 32 to the subject. 
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l 

A 


2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TXT 
val leu his his his gin asn pro glu phe 

62 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
sex- lys gly ser thr lys lys arg asp val 

182 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
pro leu leu lys asp gly arg val val thr 

242 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
leu pro ser gly tyx leu gly tyr gin glu 

302 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG 
ile lys trp val acp gly gly lye pro leu 

362 

GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT 
val tyr thr gin asp gin his leu his asn 

422 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

482 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

542 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

602 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 

ala glu pro tyr val ala ser glu tyr lys 


32 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

92 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 

val glu thr gin val gly tyr ser trp leu 
212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 

ser glu gin his ile pro val ser ala asn 
272 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

392 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

452 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

572 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile hie val val ala 

632 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser £yr val lys tyr ala tyr lys 

692 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 
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722 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC 
thr tlir ile leu lys pro ser ala asp phe 

7 82 

TGG TTT TTC TTT GAT OTA CTG ATC AAA TCT 
trp plie plie plie asp val leu ile lys ser 

842 (Cadherin Cleavage | 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT 
val lys leu leu arg asn gin arg phe pro 

902 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG 
val aen met leu met pro his ile tlir gin 

962 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC 
asn ala asn his ser leu ala val phe ile 

1022 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT 
phe val phe lys gin ile asn asn tyr ile 

1082 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT 
leu phe glu tyr lys phe glu phe leu arg 

1142 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC 
leu asn leu pro met pro phe gly lys gly 

1202 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA 
asp tyr ser leu thr asp glu phe cys arg 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG 
glu val gly thr ala leu gin glu phe arg 

1322 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT 
lys asn leu leu ile lys his ser phe asp 

1382 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT 
arg ile ala thr leu tyr leu pro leu phe 

1442 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG 
asn val arg asp val ser pro phe pro val 


752 

CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
leu thr ser asn lys leu leu arg tyr ser 

812 

ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
met ala gin his leu ile glu aen ser lys 

872 

GCA TCC TAT CAT CAT GGA GCG GAA ACC GTT 
ala ser tyr hie his ala ala glu thr val 

932 

AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
lys phe gly asp asn pro glu ala ser lys 

992 

AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
lys arg cys phe thr phe met asp arg gly 

1052 

AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
eer cys phe ala pro gly asp pro lys thr 

1112 

GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
val val cys asn his glu his tyr ile pro 

1172 

AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
arg ile gin arg tyr gin asp leu gin leu 

1232 | Cadherin EC 

AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asn his phe leu val gly leu leu leu arg 

1292 

GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val arg leu ile ala ile ser val leu 

1352 

GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
asp arg tyr ala ser arg ser his gin ala 

1412/471 

GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
gly leu leu ile glu asn val gin arg ile 

1472 

AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn ala gly met thr val lys asp glu eer 
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150* 1532 

CTC GOT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG CAG AAG GGA AGC ACC CTG GAC AAC 
leu tla lw pro ala val asn pro leu val thr pro gin lys gly ser fchr leu asp aen 

AGC CTC CAC AAC CAC CTG CTG CGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser i«u hi» ly» **P l«u leu gly ala ile eer gly il* ala cer pro tyr thr thr cer 


1622 1652 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
thr pro **n ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 


1662 "12 


TCG GOT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 
ser gly a«r. ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 


1742 1772 

CAA ACT AOC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
gin ser ««r thx leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

1802 1832 

AAG AGC CTA CTG ATG TGT TTC CTC TAG ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
lys cor leu leu mat cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

1862 "92 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

1922 " 52 

TGC CTG CAC CAG TTC CAG TAG ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

1982 2012 

GGA CCC ATA CTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2042 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

2102 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 

2162 2192 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2222 2252 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 
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22B2 2312 

TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cyB P he leu gin lys his gin ser glu thr ala leu lye asn val phe thr ala leu arg 

2342 2372 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2402 2432 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

21C2 2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2522 2552 

CTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
vol arg thr his leu gin val ile ile eer val ser gin leu ile ala asp val val gly 

2582 2612 

ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2642 2672 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser eer asp val lys asp leu thr lye arg ile arg 

2702 2732 

ACC GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

2762 2792 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2322 2852 | xxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxx:acx}oaaaaDOcaucxxxx3aoanot | 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thx arg lys gly val phe arg gin 

2942 2972 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 

3002 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3062 3092 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
cys ala asp gly leu trp lye ala glu arg tyr glu leu ile ala asp ile tyr lys leu 
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3122 

ATC ATC CCC ATT TAT GAG AAG CGG AGG GAT 
ile ile pro lie tyr glu lys arg arg asp 

3182 

ATT TAG AAG GAA CCC AAA CTC ACA CCG CTG 
ile tyr lys glu pro lys leu thr pro leu 

3242 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC 
tyr ser asp lys plie gly ser glu asn val 

3302 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC 
pro lys asp leu asp ser lys tyr ala tyr 

3362 

GAC GAA AAA GAG TTG GAA GAA AGG AAA ACA 

asp glu lys glu leu gin glu arg lys thr 
3422 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC 
ph.* met phe glu met pro phe thr gin thr 

3482 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA 
cys lys arg arg ttoc ile leu thr ala ile 

3542 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG 
pro val met tyr gin hiB his thr asp leu 

3602 xxx Coiled-Coil 1 xxxxxxxxxxxxx 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG 
ser lys lys val ala glu leu arg gin leu 

3662 acxxxxxxxxxxxxxxxxxxxxx | 
CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT 
leu gin leu lys leu gin gly ser val ser 

3722 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA 
ala arg ala phe leu asp asp thr asn thr 

37B2 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA 
leu lys glu val phe arg gin phe val glu 


3155 

TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
phe phe glu asp glu asp gly lys glu tyr 

3212 

TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
s«r glu ile ser gin arg leu leu lys leu 

3272 

AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
lys met ile gin asp ser gly lys val asn 

3332 

ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
ile gin val thr his val ile pro phe phe 

3392 

GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
glu phe glu arg ser his asn ile arg arg 

3452 

GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
gly lys arg gin gly gly val glu glu gin 

3512 

CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
his cys phe pro tyr val lye lys arg ile 

3572 jxxxxxxxxxxxxxxxxxxscxaaaacacxxxxxx 
AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
asn pro ile glu val ala ile asp glu met 

3632 xxxx Coilea-Coil 1 xxxxoocxsaacxx 
TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
cys ser ser ala glu val asp met ile lys 

3 692 

GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
val gin val asn ala gly pro leu ala tyr 


AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
lys arg tyr pro asp asn lye val lys leu 

GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
ala cys gly gin ala leu ala val asn glu 
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3842 xxx Coiled-Coil 2 xxxxxxxxxxxxx 3872 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
CGT CTG ATT AAA GAA GAG CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
arg leu ilo lys glu asp gin leu glu tyr gin glu glu met lye ala asn tyr arg glu 

3902 xxx Coiled-Coil 2 xxxxxxxxxxxxx 3932 xxxxx | 

ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
mat ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3962 3992 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

4022 J kxxx PBM xxxxj 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
4202 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
4262 4292 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
4322 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
4382 4412 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 
4442 4472 

CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 
4502 4532 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
4562 4592 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 
4622 4652 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
4682 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
4742 4772 

ATA CAC TTT AAT CCT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 


4802 
TTT ACT 
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2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

62 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

182 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
pro leu leu lys asp gly arg val val thr 

242 

OTT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
leu pro ser gly tyr leu gly tyr gin glu 

302 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG 
ile lys trp val asp gly gly lys pro leu 

362 

GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT 
val tyr thr gin asp gin his leu his asn 

422 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

482 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly hie val met ile ala phe leu pro thr 

542 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

602 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 


1 

A 

32 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

92 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

272 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

392 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

452 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

572 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

632 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 

692 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 
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722 V=>^s 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAG TCA 
fchx thr ile leu Lye pro ser ala asp phe leu tin: ser asn lys leu leu arg tyr ser 

782 812 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT GAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin bis leu ile glu asn ser lys 

842 |Cadharin Cleavage) 872 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT OCA TCC TAT CAT CAT OCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

902 932 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

362 952 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg eye phe thr pha met asp arg gly 

1022 - 1052 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1082 1H2 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
leu ph© glu tyr lye phe glu phe leu arg val val cys asn his glu hie tyr ile pro 

1142 H72 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1202 1232 | Cadherin EC 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

yyy j 1292 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

1322 1352 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1382 1*12 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg ile 

1442 i472 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 
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1502 1532 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG GAG AAG GGA AGC ACC CTG GAC AAC 
leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

1562 1592 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

1622 1652 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

1682 1712 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 
ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

1742 1772 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

1802 1832 

AAG AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
lys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

1862 1892 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

1922 1952 | xxxxxxxxxxxxiaaaaaaac 

TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

1982 xxxxxxxxxx deleted in CI*ASP-2D(KIAA1058) xacxxxxxjoaatxxxaQoacxaocac | 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 

gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2042 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

2102 2132 

AGC TAT GGC CAC TCG GAC GGA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 

Deleted 

2162 2192 \xxx 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thx leu ser leu phe thr leu ala phe lys asn 

in HC2B 

xxx I 2252 

CAC CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 
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2282 2312 

TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lye his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

2342 2372 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu lie tyr lye phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2402 2432 

CTG TGT TAG GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2462 2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2522 2552 

GTC CGG ACA CAT TTO CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val sex gin leu ile ala asp val val gly 

2 612 

ATT 2 GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2642 2672 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lye his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 

2702 2732 

ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

2762 2792 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2822 2852 |x3aaaaaoaaix3aao: Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

(Additional and differential exon usage found at position 2927 consisting 
of 69 nucleotides. This entire sequence is found in Human CLASP -2D 
(KXAA105 8) and not other isoforms of CLASP -2. It has a sequence of: 
AAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCC^C^GCCATAGCGCCTGCCTGAGGAGGAGCCGGGrGAGj 

Transmembrane Domain xxxxxx>oaooaoooaauaaaaaoaat | jf 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA & GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2972 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 
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3002 | iDauaoaauaaacxxxxxxxicKxxxxxxxxxxx Sequence deleted in CLASP-2E xxxxx 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3062 iDuoaQacaDaoanaacxx | 3092 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
cys ala asp gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 

[Additional and differential exon usage found at position 3153. The 
entire sequence below is found in Human CLASP-2D. Underlined sequence is 
found in Human CEASF-2B, 2 c and 2E. 

TGAGAGGC TGGC C CATC TGTATGACACGCTGCACCGGQC CT ACAGCAAAGTG ACC GAGGTCAT 
GCACTCGGCCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGGC AGCGCAATACCAGTTT 

ACAGACAGTGAAACAGATGTGGAGGGATT] 

3122 3155 
ATC ATC CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

3182 3212 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
ile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys lou 

3242 3272 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
tyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

3302 3332 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pro lys asp leu asp ser lys tyr ala tyr ile gin val thr his val ile pro phe phe 

3362 3392 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

3422 3452 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met phe glu mot pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

3482 3512 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

Two nucleotide deletion (nts 3586 and 3587) found in Human CLASP-2C 

i 

3542 3572 \xxx\ 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 
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3602 3632 

AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser eer ala glu val asp met ile lys 

2 3692 
CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG GTC AAT OCT GGC CCA CTA GCA TAT 
leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 

2 3752 
GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

3782 3812 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
leu lys glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 

3842 3872 

CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 

insertion of 8 nucleotides found only in Human CIAS P- 2D with sequence: CTGGGATG 


I 


3902 3932 

ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG \TC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3962 3992 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

4022 |"xacxx PBM sexxx| 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
4202 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
4262 4292 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
4322 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
CTT 2 CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 
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4442 

OCT CCA CTC GTG 
4*02 

GGA OTA TOT TOT 
4542 

TAG TGA OCT AAC 
4622 

GTC ATT AAT CAT 

A jC O -> 

T» w w A 

TCA CAT CCC CTT 
4742 

ATA CAC TTT AAT 

4802 
TTT ACT 


CCG GAA ATC TGA 
ATG ACT AGG ATT 
TGA CCT AGA CAC 
CGA CTC CGG GAC 
ATC CAT TAA GTT 
GGT TTA TTC TTG 


4472 
TCG TAA TCA GGG 

4532 
TGT GCT ATT ATC 

4592 
TCA ATT AAT CCG 

4652 
GGT CAT ATA TGT 

4712 
TAA TTG TGA TAA 

4772 
TCA TAA AAA TGT 


TAC AGA ACT TAC 
TCA TTC AAC AAC 
CTA CTG GCT TCA 
ATT ACA TTT CTA 
ATT TGT GCT GGT 
GCA ATA TGG AGA 


TAG TTT TGT CTA 
ATA GAG CAA GAA 
AGT CAG AAC TTT 
CAT TTT TAA TAC 
CCA GTA TAT GCA 
TGT ATA CAA GTC 
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HC2A 

HC2-80 

HC2B 

HC2C _ — — ~~ 

HC2D-KIAA1058 GCATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCATCTACAGGCAAGACAGCAAT 

HC2E 

HC2F 


HC2A ~~ 

HC2-80 

HC2B 

HC2C ——————————————————— — ——————— __—.———.—————— ————————————————————— 

HC2D-KIAA1058 AAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGGAAACCTGAGAAGATG 

HC2E 

HC2F 


HC2A 

HC2-80 

HC2B 

HC2C ~ ——————————————— — — ———————— 

HC2D-KIAA1058 GCTAAGCTCCCAGTGATTTTAGGCAATCTAGACATTACAATTGATAATGTTTCCTCAGAC 

HC2E 

HC2F 


HC2A 

HC2-80 ~" 

HC2B ~ 

HC2C ~" ——————————— — — 

HC2D-KIAA1058 TTCCCTAATTATGTTAATTCATCATACATTCCCACAAAACAATTTGAAACCTGCAGTAAA 

HC2E ZZZZZ'Z 

HC2F 


HC2A 

HC2-80 " 

HC2B Z~ZZ 

HC2C ———— —— — — — — — *"*" ~ — — — — 

HC2D-K1AA1058 ACTCCCATCACGTTTGAAGTGGAGGAATTTGTGCCCTGCATACCAAAACACACTCAGCCT 

HC2E ~_ ~~ 

HC2F 1 
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HC2A 

HC2-80 

HC2B 

HC2C 

HC2D -KIAA1 058 TAC ACCATC TACACCAAT CACC TT TACGTTTATCC TAAGTAC T TGAAATACGACAGTCAG 

HC2E 

HC2F 


HC2A 

HC2-80 

HC2B 

HC2C " ~ " 

HC2D-KIAA1 058 AAGTC TTT TGCCAAGGC TAGAAATAT TGCGATTTGCATTGAAT TCAAAGAT TCAGATGAG 

HC2E 

HC2F 


HC2A 

HC2-80 

HC2B 

KC2C 

EC2D-KIAA1058 GAAGACTCTCAGCCCCTTAAGTGCATTTATGGCAGACCTGGTI^GCCAGTTTTCACAAGA 

KC2E s 

HC2F 


HC 2A AGT T T T ACAC CAT CAC CAAAACCCAGAAT T T T ATGAT GAGAT T AAA 

HC2-80 

HC2B 

HC2C ' 

HC2D-KIAA1058 AGCGCCTTTGCTGCAGTTTTACACCATCACCAAAACCCAGAATTTTATGATGAGATTAAA 

HC2E 

HC2F 


HC2A ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTrGCTCACATTCTTCCATGTC 

HC2-80 

HC2B 

KC2C — — — — — - — — —————— — — 

HC2D-KIAA1058 ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTCAC^TTCTTCCATGTC 

HC2E ~~ 

HC2F • 


HC2 A AGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGAAACCCAAGTT 

HC2-80 " 

HC2B ~~ 

HC2C — — — — — — — ""*" 

HC2D-KIAA1058 AGC TG T GACAAC T CAAGT AAAGGAAGCACGAAGAAGAGGGAT GT CG T T GAAAC C CAAGT T 

HC2E 

HC2F 
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HC2A GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2-80 "~~ 

HC2B " 

HC2C 

HC2D-KIAA1058 GGCTAGTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2F "~ ZZZ 

HC2F 

HC2A CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAAGAGCTTGGGATGGGCAGG 

HC2 -HO ~ 

HC2B ~~~ """" 

HC2C *~ —————— — — 

HC2D KIXA10LB CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAGGAGCTTGGGATGGGCAGG 

KC2£ ™" Z ZZ Z~ 

HC2F 

HC2A cattatggTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAATTTCCACT 

HC2-8G " ~ ZZZZZ 

HC2B _ ~" "~~ 

HC2C — — ~~ 

HC2D-KIAA1C56 CATTATGGTC CGGAAAT TAAAT GGGTAGATGGAGGCAAGCCACTGC TGAAAAT TTCCACT 

HC2E ' II" Z I~~"I IIIII 

HC2F 

HC2A CATC T GGT T TC TACAGTGTATACTCAGGAT CAGCATT TACATAAT T T TT T CCAGTAC TGT 

HC2-80 ~~ 

HC2B ~ ""I™ 

HC2C — — — ~" 

HC2D-KIAA1058 CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCAGTACTGT 

HC2E IIII""II"""I"""I™I 

HC2F 

HC2A CAGAAAAC CGAATCTGGAGCCCAAGCC T TAGGAAAC GAACTT GTAAAGTACC T TAAGAGT 

HC2-80 ~ H""III"" 

HC2B ~~~~ " ~ 

HC2C 

HC2D-KIAA1 056 CAGAAAAC CGAAT C TGGAGC CCAAGC C T TAGGAAAC GAACT TGT AAAGT ACC T T AAGAG T 

HC2E 7 "~"""I""""""" Z~ 

HC2F 

HC2A CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

ljq2 —80 — — — — ————————————————— — 

HC2B GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2C — —————— 

HC2D-KI AA1 058 CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2E GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2F 
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TTCCGAGTCCTCACCAGAGCCACACAGC3AAGAAGTCGCGGTTAACGTGACTCGGGTCATT 


TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 


TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 
TTCCGAGTCC TCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCAT T 


ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 


ATTCATGTGGrTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 


ATxCATGTGGTTCCCC^GTGCCATG^GGM^ 

ATTCATGTGGTTGCCCAGTGCCU^TGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 


AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATAGAAGACAGTGCATGAAGAA 


AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 


AAG T AC GC G T AT AAGGC T GAGC CAT ATG T T GCC T C TGAAT AC AAGACAG T G CAT GAAGAA 
AAGTACGC G TATAAGGC T GAGC CATATGT TGCC TC TGAAT ACAAGACAGTGCATGAAGAA 


CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCA.GCAACA?^ 


CTGACCAAATCC^TGACCACGATTCTCAAG^CTTCTGCCGATTTC^ 


CTGACCAAATCCATCACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 
CTGACCAAATCCATGACCACGATTCTCAAGCCTTC^ 


CTACTGAGGTACTCATGGTTTTTCTTTGATGTACrGATCAAATCTATGGCTCAGCATTTG 


CTACTGAGGTACTCATGGTTTTTCTTTGATGTACrGATCAAATCTATGGCTCAGCATTTG 


CIACTGAAGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 
CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATrTG 


ATAGAGAAC T CCAAAGT TAAG T TGC TGCGAAAC CAGAGAT T TCC TGCAT CC TATCATCAT 


AT AGAGAAC TCCAAAG T TAAGTTGC TG CGAAACCAGAGATT TC C T GCATCC TATCATCAT 


ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 
AT AGAGAAC TCCAAAGTT AAGT TGCTGCGAAACCAGAGAT T TCC TGCATCC TATCATCAT 


HC2F 
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HC2A GCAGC GGAAAC C G T T GTAAATATGC T GAT GCCACACATCAC T CAGAAG T T T GGAGATAAT 

— 80 — — — — _ _ — — — — — — — — — ——————— — — — — — — — — — — — — — — —————— ————— 

HC2 B GCAGC GGAAACC G T T GTAAATAT GC T GAT GC C ACACATCAC T CAGAAG T TT GGAGATAAT 

HC2C — — — — —————————————————— — — 

HC2D-KIAA1058 GCAG T GGAAACC G T TGTAAATATGC T GAT GC CACACATCAC T CAGAAG T T T C GAGATAAT 

HC2E G CAGC G GAAACC G T TGTAAATAT GC T GAT GCCACACATCAC T CAGAAGT T T GGAGATAAT 

KC2F 

HC2A CC^GAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2-80 

HC2B C CAGAGGC ATC T AAGAAC GCGAATCATAGC C TT GC TG T C T T CATCAAGAGAT G T T T CAC C 

HC2"* ——————————— — — — — — — — —— —————————————————————————— 

HC2D-KIAA105 8 C CAGAGGC ATC T AAGAAC GCGAATCATAGC C T T GC TGT C TT CATCAAGAGAT G TT T CAC C 

HC2E CCAGAGGCATC TAAGAACGCGAATGATAGCCT TGCTGTCT TC ATCAAGAGATGTTTCACC 

HC2F 

HC2A T TCATGGACAGGGGCTTTGTC TTCAAGCAGAT CAAGAACTAC ATTAGCT GTTTT GCTCCT 

HC2-80 — — — — — — ______—___.——————————————— ———————— ———————— 

HC2B T XCATGGACAGGGGC TT TGTC TTCAAGCAGATCAACAACTACATTAGCTGTTT TGCTCCT 

HC2C ———————— — — — — — ——————————— — — 

HC2D-KIAA1058 TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2E TTCATGGACAGGGGCT TTGTC T TCAAGCAGATCAACAACTAC ATTAGCTG T T TTGC TCCT 

HC2F 

HC2A GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

JJ£2 _ 6 Q — — —— — —— ————————————————————————— ——— — — 

HC2B GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2C "~ ~* ——————— — — 

HC2D-KXAA1058 GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2E GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2F 

HC2A QAACATTATATTCCGTTGAACTTACCAATGCCAITTGGA^^ 

jj^2 —30 —————————————————————— 

HC2B GAAC^TTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

HC2C — — — — — — — — — — — — — 

HC2D-KIAA1058 (^CATTATATTCCGTTGAACTTACCAATG^CA^ 

HC2E GAACAT T ATATTCCGT TG AACTTACCAATGCCATT TGGAAAAGGCAGGATTCAAAGATAC 

HC2F 

HC2A CAAGACCTCCAGC T TGAC TACT CATTAACAGATGAGTTCT GCAGAAACCAC TT C T TGGTG 

HC2 _ 80 TCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2B CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2C ~ — — ~ —————— ——— — — — ———— — ——— — — — — — — — — 

HC2D-KIAA1058 CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 
HC2E CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2F 
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GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCA6GAGTTCCGGGAGGTCCGTCTGATC 
GGA.CTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 
GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 


GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 
GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 


GCCATCAGTGTGCTCAAGAACC TGC TGATAAAGCATTC T TT TGATGACAGATATGC T TCA 
GCCATCAGTGTGC TCAAGAACC TGC TGATAAAGCATTC T TT T GATGACAGATATGC T T CA 
GCCATCAGTGTGCTCAAGAACC TGC TGATAAAGCATTCT TT TGATGACAGATATGC TTCA 


GCCATCAGTGTGCTCAAGAACC TGCTGATAAAGCAr xc T T T 'i'GA'l'CiACACiATATGCTTCA 
GCCATCAGTGTGCTCAAGAACCTX^TGATAAAGCATTCTTTxX^TGACAGATATGCTTCA 


AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCC^CCCTCTACCT<^CTCTGTTTGGTCTGCTGATTGAA 


AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 


AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 
AACGTCCAGCGGATCAATGTGAGGGAT GTGTCACCC T TC CCTGTGAACGCGGGCATGACC 
AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 


AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACT 
AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 


GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATCCCTGGCTCTACGAGCTGTGAATCCGCTGGTGACGCCGGAGAAGGGA 


GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 


AGC ACCC TGGACAACAGC C TGCACAAGGACC TGC TGGGCGCCATCTCC GGCATTGC TTC T 
AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 


AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
AGCACCCTGGACAACAGCCTGCACAAGGACC TGC TGGGCGC CATC TCCCGCATTGCTTCT 


HC2F 
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HC2A CCATATACAACCTCAACTCCAAACATCAAiCAGTGTGA.GAAA.TGCrGATTCGAGAGGATCT 

HC2 - 8 0 CGATATACAACCTCAACT CCAAACATCAACAGTGT<^ TCGAGAGGATCT 

HC2B C CA TAT ACAACCT CAAC T C CAAACAT CAACAGT GT GAGAAAT GC TGAT T CGAGAGGATC T 

HC2C 

HC2D-KIAA1058 C CAT AT ACAACC T CAAC TC CAAACAT CAACAGTGT GAGAAAT GC TGAT TCG AGAGGAT C T 

HC2E C CAT AT AC AACC T CAAC T C CAAACAT CAACAGT GT GAGAAAT GC TGAT T C G AGAGGAT C T 

HC2 F GCTGAT TCGAGAGGATCT 

HC2A C TCAT AAGCACAGAT TC GGGTAACAG C C T T C CAGAAAGGAAT AGTGAGAAGAGCAATT C C 

HC2 - 8 0 C TCATAAGCACAGATTCGGGTAACAGC CTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2B C TCATAAGCACAGATTCGGGTAACAGC C T TCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2C 

HC2D-KI AA1 059 CTCATAAGCACAGATTCGGGTAACAGCCT TCCAGAAAGGAAT AGT^ 

HC2E CTGATAAGCACAGATTCGGGTAACAGCCT TCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2F C TCATAAGC^CAGATTCGGGTAACAGCCT TC^AGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2A C TGGATAAGCACCAACAAAGTAGCACAT TGGGAAAT TC CGTGGTTC GCTGT GATAAAC TT 

HC2-80 CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTG 

HC2B C TGGATAAGCACCAACAAAGTAGCAGAT TGGGAAAT TC C GTGGTTCGCTGTGATAAACTT 

HC2C 

HC2D-KIAA1058 CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGT^ 

HC2E C TGGATAAGCAC CAACAAAGTAGCACAT T GGGAAAT TC C G T G GTT C GCTGT GAT AAAC TT 

HC2 F CTGGATAAGCACCAACAAA.GTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2A GACCAGTCTGAGATTAAGAGCC TACTGATGTGT T TCC TCTACATC T TAAAGAGCATGTC T 

HC2-80 GACCAGTCTGAGATTAAGAGCC TACTGATGTGTTTCCTCTACATC T TAAAGAGCATGTCT 

HC2B GACCAGTCTGAGATTAAGAGCCTACTGATCTGTTTCCTCTACATCT TAAAGAGCATGTCT 

HC2C 

HC2D-KIAA1 058 GACCAGTC TGAGATTAAGAGCCTACTGATGTGTT TCC TCTACATC T TAAAGAGCATGTCT 

HC2E GACCAGTC TGAGATTAAGAGC C TACTGATGTGT TTCC T CTAGATC TTAAAGAGCATGTCT 

HC2F GACCAGTC TGAGATTAAGAGCCTAC TGATGTGT TTCC TCTACATC T TAAAGAGCAT GTC T 

HC2A GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2-80 GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2B GATGATGCTTTGTTTAGATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2C 

HC2D-KIAA1058 GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2E GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2F GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAA.CATCTGAACTTATGGATTTTTTT 

HC2A ACAATAT C TGAAGT CTGCC TGCACCAG TTCCAGTAGATGGGGAAGCGATACATAGCCAGG 

HC2 - 8 0 ACAATATC TGAAG T C T GC C TGCAC CAG T T C CAGTACAT GGGGAAGCGATACAT AGCCAGG 

H C2B ACAATAT C T GAAG TC TGC C T GC AC CAG T TC CAGTACAT G GGGAAGCGAT ACAT AG C CAGG 

HC2C 

HC2D-KIAA1058 ACAATATC TGAAGTCTGCCTGCACCAGTTCCAGaACATGGGGAAGCGATACATAGCCAG- 

hc2 e ACAATATC TGAAG T C T GC C TGCAC CAGT T C CAGTACAT GGGGAAGCGAT ACATAGC CAGG 

HC2F ACAATATC T GAAGT C TGC C TGCAC CAGT T C CAGTACAT GGGG AAGC GAT ACAT AGC CAG - 
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AACGAGGAGGGGTTGCGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTafGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 


AA 

AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 


CGT AACAGAACAGGAATGATGCAT GCC AGAT TGCAGCAGC TGGGCAGC C TGGATAACTCT 
CGTAACAGAACAGGAAIGATGCATGCCAGAT TGCAGCAGC TGGGCAGC C TGGATAACTC T 


CAGGAATGAxGCATGCCAGAxTGCAGCAGC xGGGCAGCC rGGAx AACxC T 

C GTAACAGAACAGGAAIGATGCATGCC AGATT GCAGCAGC TGGGCAGCC TGGATAACTCT 
'. GCTTGGAA 


C TCACT T TTAACCACAGC TATGGCCAC TCGGACGCAGATG T T CTGCACCAG TCATTAC T T 
C TC AC T T TTAACCACAGC TATGGCCAC TCGGACGCAGATGT TC TGCACCAGTCAT TACT T 
C T CACTT TTAACCACAGC TATGGCCAC TCGGACGCAGATGT T C TGCACCAG T CATTACT T 


C TC ACT T T TAACC ACAGC TATGGCCAC TCGGACGCAGATG TTCTGCACCAGTCATTACTT 
C TCACT T T TAACC ACAGC TATGGCCAC TCGGACGCAGATGT TCTGCACCAGTCATTACTT 
- T X TCT GTAGACAATGGC TATGGCCACTCGGACGCAGAT GT T CTGCACCAG TCATTACT T 


GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACAxTGCT7VCTGAGGTTTGCCTGACAGCTCTGGA<^C^ 

GAAGCCAACATTGCTAC TGAGGT TTGCCTGACAGC TC TGGACACGC T TTC TC TATT TACA 


GAAGCCAACAT TGCTAC TGAGGT TTGCCT GACAGC TC TGGACACGC T TTCT CTATT TACA 
GAAGCCAACAT TGCTACTGAGGT TTGCCTGACAGC TC TGGACACGC T TTC T C TATT TACA 
GAAGCC^C^T TGCTAC TGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 


T TGGCGT TTAAGAACCAGCTCC TGGCC GACCATGGACATAATCC TC TCATGAAAAAAGTT 
T TG GCGT TTAAGAACCAGCTCC TCGCCGACCATGGACATAATCC TC TCATGAAAAAAGTT 
TTGGCGTTTAAG CTCCTGGCCGACCATGGACATAATCCTC TCATGAAAAAAGTT 


T TGGCGTTTAAGAACCAGCTCC TGGCCGACCATGGACATAATCC TC TCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAACTT 
TTGGCGTTTAAGAACC?VGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAAAA 


TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTT TCTTCAAAAACATCAGTCTGAAACGGCT TTAAAAAATGTC 
TT TGATGT CTACCTGTGTT T TC T TCAAAAACATCAGTCT GAAACGGC T T TAAAAAATGTC 


TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCT TTAAAAAATGTC 
XT TGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCT GAAACGGC TT TAAAAAATGTC 
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TTOACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAAC^^ 

TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTT TCCCTCAACAT TC T ATGAAGGGAGAGCG 


TTCACTGCCTTAAGGTCCTXAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTGACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACAOT 


GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGT TACGAGATTCT CAAGTGCTGTAACT CCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 


GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGT TACGAGATTCT CAAGTGCTGTAACT CCAAGCTGAGCTCC 


ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGC TGCTC TACTTCCTGATGAGGAACAAC TTTGATTACACT 


ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAAGAACTTTGATTACACT 


GGAAAGAAGTCCTT T GTCCGGACACAT TTGCAAGT CATCATATC TGTCAGCCAGC TGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 

GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATA 


GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTT TGTCCGGACACAT TTGCAAGTCATCATAT C TGTCAGCCAGCTGATA 


GCAGAC GTTGTTGG CAT T GGGGAAAC CAGAT T CCAGC AGT C C C TGT C CATCATCAACAAC 
GCAGACGTTGT TGGCAT TGGGGAAACCAGAT TCCAGCAGT CCC TGTCGA!^ 
GCAGACGT TGTTGGCATTGGGGAAACCAGAT TCCAGCAGTCC CTGT C CATCATCAACAAC 


GCAGACGTTGT TGGCAT TGGGGGAACCAGATTCCAGCAGTCCC TGT C CATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCC^ 


t gt gc caacagt g accgg c t tattaag cacaccagc ttctcctc tg atgt g aag gac t ta 
tgtgccaacagtgacxiggcttattaagcacacgagct^ 

tgtgcc^uicagtg^cx:ggcttattaagcac^cagcttctcctctgatgtgaaggactta 


tgtgccaacagtgaccggct tattaagcacaccagcttc tcc tctgatgtgaaggactta 
tgtgccjuu^agtgaccggcti^ttaagc^caccagcttctcctctgatgtgaaggact^ 


HC2F 
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HC2A ACCAAAAGGATACGCACGGTGCTAATGGCC^CCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2 - 8 0 ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2B AC GAAAAG GATACGCACGGTGCTAATGGCCAC CGCCC AGAT GAAGGAGCATGAGAACGAC 

HC2C 

HC2D-KIAA1058 AC CAAAAGGATACGCACGGTGCTAATGGCCAC CGC CCAGAT GAAGGAGCATGAGAACGAC 

HC2E AC CAAAAGGATACGCACGGTGC TAAT GGCCAC CGCCC AGAT GAAGGAGCATGAGAACGAC 

HC2F 


HC2A CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2 - 8 0 CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCC TATGCCAGCACGCCCGAG 

HC2 B CGAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCC TATGCCAGCACGCCCGAG 

HC2C 

KG 2D - KZAA1 058 C CACAGAT GCIGCHG GAG C T CGAG TACAG C C T GG C GAAAT CC TATG G CAGGAG GG G G GAG 

HC2E CCAGAGATGCTGGTGGAC CTCCAGTACAGCC TGGC CAAATCC TATGCCAGCACGCCCGAG 

HC2F 


HC2A CTCAGGAAGACGTGGCTCGACAGCATGGCC^GGATCCATGTCAAAAATGGCGATCTCTCA 

HC2 - 8 0 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2B CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCC ATG T C AAAAAT GGC GATCT C TCA 

HC2C " 

HC2D-KIAA1058 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2E C TCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTC^ 
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HC2A GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 
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HC2C 

HC2D-KIAA1058 GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGC^GAATATCTCACACGGAAA 

HC2E GAGGCAGCAATGTGC TATGTCCACGTAACAGC CC TAGTGGCAGAATATC TCACACGGAAA 
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HC2D-KIAA1 058 GAAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGG 

HC2E G 

HC2F 

HC2A GCGTG T T TAGACAAGGAT GCAC CGCC T TCAGGGT CAT TACC CCAAACATC 

HC2 - 80 GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2B GCGTGTTTAGACAAGGATGCAC CGCC TTCAGGGTCAT TACC CCAAACATC 

HC2C G T TTAGACAAGGAT GCAC C GC C T TCAGGGT CATTACCC CAAACATC 

HC2D-KIAA1058 AGCCGGGGAGGCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

jjC2E GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 
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HC2-80 GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 

HC2B GAC GAGGAGGC C T CCAT GATGGAAGAC GT GGG GATGCAGGATG TC CAT T TC AAC GAGGAT 

HC2C GAC GAGGAGGC C TCCAT GAT G GAAGAC G T GGG GATGCAGGATG TC CAT T T C AAC GAGGAT 

HC2D -KIAA1 056 GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 

HC2E GACGAGGAGGCC TCCATGATGGAAGAC GTGGG GA 
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HC2A GTGCTGATGGAGC TCC T TGAGC AGTGCGCAGATGGAC TCTGGAAAGCCGAGCGC TACGAG 

HC2 - 8 0 GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGAC TCTGGAAAGCCGAGCGC TACGAG 

HC2B GTGCTGATGGAGC TCCTTGAGCAGTGCGCAGATGGAC TCTGGAAAGCCGAGCGC TACGAG 

HC2C GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2D - KXAA1 058 GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGAC TCTGGAAAGCCGAGCGC TACGAG 

HC2E AAGCC GAG CGC TACGAG 
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HC2F 
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HC2E CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

HC2F 

HC2A : C T T T GAAGAT GAAGATGGA 

HC2-80 CTTTGAAGATGAAGATGGA 

HC2B GAT TCTTTGAAGAT GAAGATGGA 

HC2C GAT TCTTTGAAGAT GAAGATGGA 

HC2D-KIAA1 058 CAATAC CAG T T TACAGACAGTGAAACAGATG T GGAGGGAT TC T T TGAAGAT GAAGAT GGA 

HC2E GATTCTTTGAAGAT GAAGATGGA 

HC2F 
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AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATraCTCAGAGACTC 
AAG GAGTATAT T T ACAAGGAAC CCAAAC T CACAC CGC T G T C G GAAAT TT C T CAGAGAC TC 
AAGGAGTATAT T TACAAGGAAC CCAAAC TCAC AC CGC T GTCG GAAAT TTC TCAGAGACTC 
AAGGAGTATAT TTACAAGGAACCCAAAC TCACAC CGC T GTCGGAAAT TTC TCAGAGAC TC 
AAGGAGTATATTTACAAGGAAC CCAAAC TCAC AC CGC T GTCGGAAAT TTCT CAGAGAC TC 


CTT AAAC TGTAC TCGGATAAAT TTGGT TCTGAAAATG TCAAAATGATACAGGATTCTGGC 
CT T AAACTGTAC TCGGATAAAT TTGGT TC TGAAAATG TCAAAAT GATACAG GATTC TGGC 
CTT AAAC T GTAC T CGGAT AAAT T T GGT T C TGAAAAT G TCAAAATGAT ACAG GAT T C T GGC 
7TTAAACTGTACTCGGATAAATTTGGT TC TGAAAAT GTCAAAATGACACAG GATTC TGGC 
CTT AAACT G TAG T CG GATAAAT-T-TGG T-T CTGAAAAT G T CL*JVAAT GAT AGAG GAT T C T GGC 
CTT AAACT GTACTCGGATAAAT T TGGT TCTGAAAATGTCAAAATGATACAGGATTC TGGC 


AAGGTCAACCC TAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGAC TCACGT CATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAG GT CAACCC TAAGGATCTGGATTC TAAGTATGCATACAT CCAGGTGAC TCACGT CATC 
AAGGTCAACCC TAAGGATC TGGAT TC T AAGTATGCAT ACAT CCAGGTGAC T CACGTCATC 
AAG G TCAACCC TAAGGATC TGGAT TC TAAGTATGCC T ACAT CCAGGTGAC T CACGT CATC 
AAG GTCAACCC TAAGGATC TGGAT TC TAAGTATGCAT ACAT CCAGG TGAC T CACGTCATC 


CCC TTCTTTGACGAAAAAGAGT TGCAAGAAAG GAAAACAGAG T T T GAGAGA T C CCACAAC 
CCC T TC TT TGACGAAAAAGAGT TG CAAGAAAGGAAAACAGAGT TTGAGAGATCCCACAAC 
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GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATAGACTGCTTC^ 
GAAGAGCAGTGCAAACGGCGCACCATCC TGACAGCCATACAC TGC TTCCCT TATGTGAAG 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCT TCCC T TATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGC TTCCCT TATGTGAAG 
GAAGAGCAG TGCAAAC GG C GCACCAT C C T GAC AGCCAT ACAC TGCTTCCCT TATG T GAAG 
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AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTCTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCAXCGAGGTGGCCATT 
AAGCGCATCCCTTTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGT — CCATT 
AAGCGCATCCCTGTGATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 


GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GAC GAGAT GAGTAAGAAGGT GGCGGAGCTCC GGCAGC TGTGC TCC TCGGCCGAGGT GGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GAC GAGAT GAGTAAGAAGGTGGCGGAGC TCC GGCAGC TGTGC TCC TCGGCCGAGGT GGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCICGGCCGAGGTGGAC 


ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCkGGTC^ 
ATGATCAAACTGGAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCrGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGK^CAAACTCCZAGGGCAGCGT GAGTGTTCAGGTCAAT GCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAAtrGCTGGCCCA 
ATGATCAAAC TG CAGCTCAAAC T CCAGGGCAGCGTGAGT GTTCAGGT CAATGCTGGCCCA 


CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
C T AGCAT AT GCGC GAGC TTTCT TAGATGAT ACAAACACAAAG C GAT AT CC T GACAAT AAA 
C TAGCAT ATGCGC GAGC TTTCT T AGAT GAT ACAAACACAAAG C GAT AT CC T GACAATAAA 
C TAGCATATGCGC GAGC TTTCT T AGAT GAT ACAAACACAAAG CGAT AT CC T GACAATAAA 
C T AGCAT ATGCGC GAGC TT T CT TAGATGATACAAACACAAAGCGAT ATCC T GACAATAAA 
C TAGCAT ATGC GC GAGC TTTCT T AGAT GAT ACAAACACAAAG CGAT AT C C T GACAATAAA 


GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCT TAAGGAAGTT T TCAGGCAATT TGTGGAAGC TT GC GGTCAAGCCT TAGCG 
GTGAAGC TGCT TAAGGAAGTT T TCAGGCAATT TGTGGAAGCTTGC GGTCAAGCC T TAGCG 
GTGAAGCTGCT TAAGGAAGTTTTCAGGCAATT TGTGGAAGC TTGCGGTGAAGCCT TAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGC TGCT TAAGGAAGTT T TCAGGCAATT TGTGGAAGCTTGCGGTCAAGCCTTAGCG 
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GT AAAC GAAC GT C T GAT T AAAGAAGAC CAGC T C GAGT AT CAG GAAG AAAT G AAAGC CAAC 
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GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAA 
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TAAAGAACAACGT TATTTCTTAACAGACTT TCTATAGGAGTTGTAAGAAGGTGCACATAT 


TAAAGAACAACGT TATTT CTT AACAGAC T T TC TATAGGAGT T GTAAGAAGGTGCACATAT 
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1 
A 

2 32 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
val leu his his his gin asn pro glu phe tyr asp glu ile lys ile glu leu pro thr 

62 92 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
gin leu his glu lys his his leu leu leu thr phe phe his val ser cys asp asn ser 

122 152 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
ser lys gly ser thr lys lys arg asp val val glu thr gin val gly tyr sar trp leu 

182 212 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
pro leu leu lys asp gly arg val val thr ser glu gin his ile pro val ser ala asn 

242 272 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu pro ser gly tyr leu gly tyr gin glu leu gly met gly arg his tyr gly pro glu 

302 332 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
ile lys trp val asp gly gly lys pro leu leu lys ile ser thr his leu val ser thr 

■ re£ 1*1/ 1.2 and 1.3 
362 I 392 

GTG TAT ACT CA<* GAT CAG CAT TTA CAT AAT TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
val tyr thr gin asp gin his leu his asn phe phe gin tyr cys gin lys thr glu ser 

422 452 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
gly ala gin ala leu gly asn glu leu val lys tyr leu lys ser leu his ala met glu 

482 512 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
gly his val met ile ala phe leu pro thr ile leu asn gin leu phe arg val leu thr 

542 572 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
arg ala thr gin glu glu val ala val asn val thr arg val ile ile his val val ala 

602 632 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
gin cys his glu glu gly leu glu ser his leu arg ser tyr val lys tyr ala tyr lys 

662 692 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
ala glu pro tyr val ala ser glu tyr lys thr val his glu glu leu thr lys ser met 
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722 752 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
thr thr ile leu lys pro ser ala asp phe leu thr ser asn lys leu leu arg tyr ser 

782 812 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

842 ICadherin Cleavage] 872 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

902 932 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

962 992 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg ays phe thr phe met asp arg gly 

ref 2 . M 

1022 1052 ^ 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1082 1112 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr ile pro 

1142 1172 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1202 1232 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

1262 1292 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

■ ref 3.1 

1322 1352 1 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA a3g AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1382 1412/471 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg xle 

1442 1472 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 
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1502 1532 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG GAG AAG GGA AGC ACC CTG GAC AAC 
leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

| ref 4 . 1 and 4 . 2 
1562 1592 1 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 


1622 1652 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
thr pro asn ile asn ser val ar-g asn ala asp ser arg gly ser leu ile ser thr asp 

ref 5 . 1 and 5 . 2 

1682 1712 J, 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 

ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

1742 1772 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

1802 1832 

AAG AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
lys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

1862 1892 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

■ ref 6.1 

1922 I 1952 

TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 


1982 2012 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn ax-g thr gly 

2042 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

2102 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 


ref 7.1 | 

2162 2192 1 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2222 2252 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 

2282 2312 
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TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lys his gin aer gin thr ala leu lys acn val phe thr ala leu arg 

2342 2372 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2402 2432 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyx glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2462 2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 

ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 
2522 2552 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

2582 2612 

ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2642 2672 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 

2702 2732 

ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thx ala gin met lys glu his glu asn asp pro glu met leu iral 

2762 2792 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2$22 2852 | raoOTonanouaoaa: Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxKxxxxKXXxxxxxxxxaaaaaaaac | 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2942 2972 

GGA TGC ACC GCC TTC AGS GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 
ref 8.1 | 

3002 J. 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3062 3092 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
ala asp gly leu trp lys ala glu arg tyx glu leu ile ala asp ile tyr lys leu 
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J ref 9 . 1 
3152 
AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile il» pro ile tyr glu lye arg arg asp phe phe glu asp glu asp gly lys glu tyr 


31B2 3212 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 

ile tyr lym glu pro lys leu thr pro leu ser glu ile eer gin axg leu leu lys leu 

ref 10.11 

324S 3272 

TAC TCG GAT AAA TTT GOT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG' GTC AAC 

tyr «er asp lys pa© gly ser glu asn val lys met ile gin asp ser gly lys val asn 


3302 3332 

CCT AAO OAT CTO GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pro lys asp lou «nt> bar lys tyr ala tyr ile gin val thr his val ile pro phe phe 

3362 3392 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg axg 

3422 3452 

TTC ATC TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met ph* glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

| re£ 11.1 

3482 J, 3512 

TGC AAA COO CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 


3542 3572 [ joooccxxx Coiled-coil 1 xaouatx 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val set tyr glu his his thr asp leu asn pro ile glu val ala ile asp glu met 


3602 xxxxxxxx Coiled coil 1 cont'd xxxx 3632 xxxxxjacxxxxxxxxxxxxxxxxxxxaoaooac: 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lyo val ala glu leu arg gin leu cys ser ser ala glu val asp met ile lys 

4 ref 12.1 
TC AAT GCT GGC CCA CTA GCA TAT 

leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 


3722 3752 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

3782 3812 | xxxxxxx^ooDoauaauot 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
leu lys glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 

3642 jQouoaoc Coiled coil 2 xxmoo^xxx 3872 Kxiaaixxiaaaaocxxxxxxxx^^ 

CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 

arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 

3902 xxxxxxxxxxxxxxxxxxxxxxxxxxacxacKxxx 3932 xxx\ 
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ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
mat ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3962 3992 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
sex val leu pro asn ser leu his ile ph© asn ala ile ser gly thr pro thr ser thr 

4022 I 3000 * PBM »3C2Cacx| 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 4112 

TGG GGA CTT GCT TTG TGA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
4202 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
4262 4292 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
4322 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
4382 4412 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 

Iref 13.1 
4472 
CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 

4502 4532 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
4562 4592 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 

■ re£ 14.1 

4622 4652 I 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT fl&T ACA TTT CTA CAT TTT TAA TAC 
4682 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
4742 4772 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4802 
TTT ACT 
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BAC sequences of Human CLASP 2 
Rcf 1.1 

Sequence of BAC4 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 356-375. 

TTTCTACAGNGTNTACTCAG GTATGTGCTCCTTCAACAAAATTAGCAGTTGCTGCTCT 

GTGACAAAGTTTGCACCATTTTGCAAGAAGAAAAAAATCCTAATGTGTTATATTA 

TATTTTTACTCTATAGATCTTTTTCTAAAGAAAGAAAGTACAACTGAAGTGCTTATAT 

Gl ATTCATATAuAATGACTAGTACAAGCATCATTTTGCAACAGATTTCCCCTTTCATTG 

GAGGATCTTCTTGATGTTATTTGTACACGATCAATTTTTAGTCTTAATAAG 

TGGGTGTGGTGGCTCACACCTGTAATCCTAGCATTTTGGAGGCCAAGGTGGGCAGAT 

CACTTTAGCCCAGG^TTTGAGA^ 

CAAAAATACNAAAATTATCCAGGCATGGTGATGTGTGCCTGTA 

GAGGCTAGGGGTAGGGKJGATTTGCAAGAGGCTGGGAGGGTCAAAGCCCNAANTGAG 
CCATTGGTNCATGTCACTTGGACCCCAAGCNNGGGGNGANCAAGAGCAAAGGACTNN 
TGTNNTTTANAAAAAAAACCGGGCTACCATACNNACCAACCCNCNNACCTACCCNACC 
TTTCCANNTTAAAANAAGGCTTTGNCTTGCANAGGAAAANCAAAATNNCC 

Ref 1.2 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 351-375. 

TCTGGTTTCTACAGTGTATACTNAG GTATGTGCTCCTTNAACAAAATTAGCAGTTGCT 
GCTCTGTGACAAAGTTTGCACCATTTTGCAAGAAGAAAAAAATCCTAATGTGTTATAT 
TACTATATTTTTACTCTATAGATCTTTTTCT 

ATATGTATTCATATAAATGACTAGTACAAGCATCA1TTTGCAACAGATTTCCCCTTTC 
ATTGGAGGATCITCTTGATGTTATTTGTACA 

AGGCTGGGTGTGGTGGCTCACACCTGTAATCCTAGCATTTTGGAGGCCAAGGTGGGC 

AGATCACTTTAGCCCAGGGGTTTGAGACCAGCCTGGCCAACATGGCAAAACCTTGTC 

TCTACAAAAATACAAAAATTATCCAGGCATGGTGATGTGTGCCTGTAGTCCCAGCTAC 

CTAGGAGGCTAGGGTAGGGGGATTGCAAGAGGCTNGGAGGTCAAGGCCCGCAGTGA 

GCCATGGTCATGTCACTGCACCCCCAGCCAGGGCCGACAGGAGCAAGACTNTTGTNT 

CAAAAAAAAACAGNAACCAACANCCAACAACAACAACNACCTTTCNGCAAAANAAGC 

TTG CTNC AANG AAACC AAAATGNCITCTTNIHrTTCCCCCN 

Ref 1.3 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is not found within this sequence. This sequence most likely represent intron sequence 
since this sequence matches the intron sequence found in the previous two BAC sequences. 
AGIWNNNCCCNCTACNCCACinrTTAACCTTTTGA 

CGCTCCTTCACATATTAGCAGTTGCTGCTCTGTGACATAGTTGCACCATTNTGCAAGA 

AGAAAAAATCCTAAGTGTNATATCACTATATNNNTACTCTATAGATCTTNTCTAAAGA 

AAGAAAGTCAACTGATGTGCTTATATGTATNCATATAAATGACTAGTACATGCATCAT 

TTTGCAACAGATNTCTCCTCACATTGGAGGATCTTCIWGANGNATTCGACACGATNAN 

TATTAGTCTNAATAAGATGANGCTGGTGTGGNGGTACACTGNATCTAGCATNTGGAN 

GCATGTGGCAGACACTTANCCNCGGTNGAGACAGCTGTCACTGNCNAACTGTCTCTN 

TAAANCAAANNCTCCGCNGGNGATGGGCTGAGCCAGTCCTAGNNGCTAGNTAGNGAT 
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GNNGAGNTGTNGCACGNCGAGNGAGCATGNTCTGTACTGACTCATCAGGCGNCNACA 

CGOTCTGTTCNAAAACATACCACACACACTCNCACCTNCGCAAAATTGCTCTNNAAAN 

ATGCTTNTTTCACACNGNTNCAATCNCTATATNNTCTTCTATTCTNCNACGTNTNATTA 

^ATCTTNCNCTGCANAACNATNCGNCCACCTNNA^ACCTTANGCTTNGTTTCACGC 

TTATAGCTCCCCTACACNTNNCAGCNNTTNCNNGTGAAGGGCCNCCCGAATCTACGA 

NCATACTCTCTCCGTATATNGCCTCGGTCANCGCCATCTGCTGTNTNCTCNTCNCTNG 

CiSfrrrTNANCNGTNCGCTATCTCTNNNCCGGATCCN^ 

AGCGTAANNTNTNCNCNCACTANTCACAACTTNTNCOT^AACTCTATCTNCTCCTCT 

CTACCACCTCACTTACTACCTNTTCACNCANTCTCCTTCNCrNTCCACTGATCTCCACA 

TAGCTGCTNTACTCGCCANTTTATCATATNCACACNCTCTACGCTNNNTNT 

Ref 2. 1 

Sequence of BAC4 using primer HC2S1, which spans nucleotides 1107-1126 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1079-1097. 

rnr TGTATTNAAAGAGGGTCT GCAGGAAGAAGTGTGTAGTCATAAATACCTCACTGGA 

TATTTTATACAGGATTCTAAAAAACCTATTAGCAATAGTATGCTAGAAATAGTCATTA 

GCTTCTTGACCTTCTTAGAACTGCACACTCTATTGCACTGTACAGATTTCAGGATGGC 

TGCAGGGATTGATTTGAAAACTAAGGACACATTTCAATAAACAATGTCTTCAATTGAT 

TTTTAGGGCTCCTCCTACTTCAATGAAGGACITCAGGTAGCrTATAATTACAGACACA 

GGCTCAATACAATAAAAAAATTAGTAAGGCAGAGCTTTAAAAAAAAAAAAGGAAAAA 

GATAATTCTACCAGAGAAAGGCTACATGGTGACTTCTGTTACCAGTAACAACCCCCG 

CACTACCirrGGGTCTCCAGGAGCAAAACAGCTAATGTAGTTGTTGATCTGCTTGAAG 

ACAAAGCCCCTGTCCATGAAGGTGAAACATCTCTGTGGAGGAAAACAAGCAAAAAAG 

TTATTTCAGGTCCAAACATTTCGGAAATTTGGATTCAAAGCAGGCATTTATTGCTAAT 

AAGTTTATCCACTGACATAAAAAACATGCCTTCAACATTGCCAGAGCACCTACTCTAT 

TNTAGTCNCN 

Ref 3-1 , „ 

Sequence of BAC4 using primer C96AS, which spans nucleotides 1443-1452 of the cDNA. Exon 

seauence is underlined and represents nucleotides 1370-1422. 

AATCAGCAGACCAAACAGAflGCAGGTAG AGGGTGGCTATCCITGCCTGATGGCTgT G 

. . . . ~ * . ^ . ^ » ^, . ^, , ^^.^ « i ni-rxr a rrr* nfiATTrTfiACA ArrfiAACTAAGTTGG 


AAAAGAAGACACACATGGTAAGTTTGACCCAGGATTCTGAGAACCGAACTAAGTTGG 
TGCTGACCATCTCCTTTATTTGGATCCTTCCTATAAAGACAGATATTTGATTTTAGTCC 
CAAAATAGAGCAAAATCTTAGTGCTGTTACCATGAATTTTCTAACTGATTACTTTCTTT 
ACACCACTTAAAATAAAGGACATTATCAATGCACATTCCTTCCATTGGGGACCACTCA 
CCCTTGAAGCATATCTGTCATCAAAAGAATGCTTTATCAGCAGGTTCTTGAGCACACT 
GATGGCGATCAGACGGACCTCCCGGAACTCCTGGAGGGCTGTCCCCACCTCCCTNAG 
TAACAGTCCCACCAAGAAGTGGTTTCTGCAGAACTCATCTGTTAATGAGTAGTCAAGC 
TGGGAGGTCTGAAATGAGGATAGAAACTACTTTGNGTTAGGAAAGATGCAATGCTCT 
TTTGAATAAAACAAACAAACCAAACNAACAAAAAAAAAACTAAGACCCATCCTTNTGN 
ATTTCAAGCCCACCCTGGGGTOGGTCAAAGAGATGATCAGNANTTTGGCNTTWAAAT 
GAAGAAAGAAATOAATTNTCCAGGGGNTGTTCmCTTTTTAGCACANGGAGGGATNT 

TAANTGAAAACCAATTTAAATCCAATTNAGGNG 
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Ref 4.1 

Sequence of BAC4 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 

TTCClTTCTGCAAGGCTGTTCCCGAATCTGTGCTTATGAGAGATCCTCTCGAATrAOr 

ATTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGC TAAATGGAAA 

TCAAGCCAACAATAAAGTTTTATTAuAGACAGAACAAAATAAAGATGAGTACTGAACTT 

TAAGGGAAATTGCT1TTATTGCACTTATTTTTTCTGTTAGGAAGTTGGCTCAAGA 

GCATTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAAC 

TAGTCTGAAACATTCAGATGTAAACATGAATTCACTTGTTAGAAACCACCTTTGATCG 

CTAAAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATT 

AAAAGATAAATCCACCTACAACTATCAAATCACAAAATTAAACCACACAACAAACTTG 

TAGCATTCAAACTGGTAATAAACACTGAGGAGCCTACCCAACTCTGAGGGGTGTCAT 

GGGGTATTTTAAATTTTCGAGGAGAACACAGTGATATGTGACCTCAGCCAGAAGCTG 

CTGT1TNAGCAGCAGGTTGGTGC 

GTATTTTGGGGGGCCTGCTTATGATAAAANGGCAAGGTTSriTCAAT 
Ref 4.2 

Sequence of BAC26 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 
sequence Is underlined and represents nucleotides 1602-1703. 

TTCCTTTCTGGAAGGCTGTTACCCGAATCTGTGCTTATGAGAGATCCTCTCGAATCAG 
CATTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAA 
ATCAuAGCCAACAATAAAGTTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACT 
T^AAGGGAAATTGCTTTTATTGCACTTATTTTT^ 

TGCATTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAuAAAAGAAA 

CTAGTCTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATC 

GCTAAAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAAT 

TAAAAGATAAATCCACCTACAACTATCAAATCACAAAATTAAACCNCACAACAAACTT 

GTAGCATTCAAACTGGTAATAAAACACTGAGGAGCCTACCCAACTTTGAGGGGTGTC 

AATGGGGTNTTTTTAAATTTTTCGNGGGA^^ 

AGAAGCTTGTTTGTTTNACCAAGCNAGGTTC 

TTTTNNNAAATNCTGGNTTTTTT^^ 

Ref 5.1 

Sequence of BAC4 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1724-1736. 

TTCCTGGATAAG GTAATTGCTTTTACCCAACACAAATGTTTCTTATAATCAATGGATT 

TAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGTGACCTTTTGTTTT 

TTTCTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTAAGGTTGGGGCCC 

ATTCCTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTGGGCAGGGTTTGC 

AGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACTCAAAAT 

GCTGCCCATCTCAATTTCCTGTAGTCTTTTTATTT 

TCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTNACTGCAACCT 

CCGCCTCC:TGGGTTCAAACGACTCCTCTGCCTCAGCCTCCCCAGCAGC:TGGGACCA 

CAGGCACAAGCCACCACCGCCCGGCTAATTTTTTGTNTTTTTAGTA:GAGAT:GGGGTT 

TCACCATATTTGGCCAGGCTGGGCTCAAACTCCTGACC:TCGTCATCCGCNCCCTCGG 

NCTNCCAAAGTGCTTGGGATTNCAGGCNGTGAGCCCACTTACACCTNGGGCAATTCC 

CTGTNAGTCTTTTTTACCAGAGACACCATCATTCAACACAGCITTTCCACCCACAA 


FIG. 6A (cont.) 


i 


1 


WO 02/31 1 17 PCT7US01/32202 

55/114 


Ref 5.2 

Sequence of BAC26 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1712-1736. 

tA a o A A G A GC A ATTTCCTGG ATAAG GTAATTGCTTTTACCCAACACAAATGTTTCTTA 

TAATCAATGGATTTAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGT 

CACCTTTTGTTTTTTTCTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTA 

AGGTTGGGGCCCATTCCTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTG 

GGCAGGGTTTGCAGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGA 

GGGACTCAAAATGCTGCCCATCTCAATTTCCTGTAGTCTTTTTATTTATTTATTTATTT 

TTTGAGACAGAGTCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAAT 

TCACTGCAACCTCCGNCTCCCTGGGTTCAAACGACTCCTCTGNCTNAGNCTCCC:AGC 

AGCCTGGGAACCACAGGCTCANGCCACCACGCCCGGCTAATTNTTGTAATTTTNAGT 

AANAAATTGGGGGTTCTCACCATOTTGGCCCAAGNCTTGGGCCTAAAAACCTTNCTNA 

CCNTCGNCATTCNCNCCCCNACCNTGGGCNCTNCTCAAANGNGCTTGGGGATTTANC 

ANNGGCNTTAACCCCCCNTATCACCGTGGNCCTTAATTT 

Sequence of BAC4 using primer C2S7, which spans nucleotides 1918-1937 of the cDNA. Exon 
sequence is not found within this sequence.Since the primer is directed against exon sequence we 

presume that sequence derived from C2S7 is intron sequence. 

NAGNGNGGGTTTNAGNCGTTTGAAGCCTGNNACGNGGTGNGTGCTNGAACTCTGTGG 


CAGGGCTITTTCCTCCCTGTAGCTGCTACTTATACACATAGCTCTAACTGAGATGATT 
CTCCAGACAACTGATGCAGAGCAGCAAAAGCTTCTGCCGTTCTCCCCTTCTAGGAGT 
GTCTCCTTTCTTTGGAAAGAGATCATGAGGGGCTAGATTGTAATGAAGTGAGGCTCA 
GTGCTTGAGCACATCCGGTAAAAGTTCCAATATATTGGTCATAAAGTTTCTCATTCTT 
TATAGCAGTTAATTTCTCTGGCTCATGAGTTTTCTTAGTTTTAATCTGACTTTTAAATT 
AATGTCTCCAGCACCAGTCATATCCCCAGGGCAAACTCAAAGGCATGAGAGGCCAGA 
CTCGGGTCCTGGTCATAGCAACCCCTGTCTAGGGCCTTGGTCCCTGCCTCCGCTTGT 
GTGCTGTGGCGCAGGTCCTATGGGCCCTTAGGAAACAGGACCACCCTGTCGCACCCC 

CTACAGAGACCAGCCAAGTTTGACATTAGATCACCGTAGCAATGTNTGCAAAT^ 
GTTTCTTGCTAAAACAGGTTAAGCCTTGCAGCCACTTTATCTGTAACTGGCNGAGGTT 

TTGACATAAAA 

Sequence of BAC4 using primer C2S8, which spans nucleotides 2143-2162 of the cDNA. Exon 

sequence is underlined and represents nucleotides 2182-2219. „-™-.,~r/-i»r 
GTCTCGACACr.rTnTTTGTATTAACATTGGCGTTTAA GGTTTGTATCAATTTGCTG^rr 

CGNGGTTCTAGTTTTACC1TTCACATTCATTCTGCTTGGTAAGCTCAGTGAGCACAAA 


CTTACTATGTTGCATTTTTACTICAGCAATTATTTTTGTCCCTGTAAGGAAACCATTAA 



AAATTTAACTGGTCAGTGTGTGCTGCGCGGTTGGGTATGGTGGAAAC^ACGGTCTCTG 
GAGGCAGTTAACTCTTGGCTCGAACCTTGAGGATGGTGAATATAGGCACCTAATCAG 
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GCATTTCTGCCTTGAATATCTTTAAATATATCCAAATGTTATAGCGTTTAATTAGATTT 

TTATGTAGAAAGGAGCAATAAACACAAGACACATGTTTTCAGTTTTTTATCT 

GCATTAAATGATAAAAACGTTTTGGAGATAGAAAATGAAAGGGGTTTTTT^ 

TGTTTTAAAGTTTTAGCAAATAATATTCAAGTAGGTGGAGATGGACTCTTCACCACTC 

TCCTGTTTTTAGGAACCCAATACTTTTTCATTCTTGCrAAATGATT 

GCATAGAAAAGGAGAAAATTGGAATGAGTGTTTATAT 

RefS.l 

Sequence of BAC4 using primer C2S9, which spans nucleotides 2992-3011 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

CGC1TTNAAATNCCAGCCGCTACTGCGGGGCGNTNAATTCGAAACGTGTTGTTNTCT 

GTGATGCCTGGCTCTGATTGTGTGGGATTGGTCATCAGTGGCGGTTGGCAGNTGGGG 

TTCATGGAAGCGGCCATGGGGACTGATGGCAGGCCCTTGGATTGCCACCGCAGAGCC 

TGGCAGTGTCTTTGGTCTGCATTCCTACCGGCGAAGTCTCATTTCACCTCACGTGTTA 

TCTCTTGGAAAGCATTCCTTTAGCGGGeTGTGTCTACCCTTCCATCCTCTCGTCCAAA 

CTCCCCCTCCTTCTCTGTTCTGTCTCCTTCCCATCCTCTTCTCCCCAGTTCTTCTTCCT 

ATG1TCCTTCCTCAGTGGTTTCTC1TCCTCTGTTTGACTTTCCAAGGTCATTTTGACTG 

TTCCTGCTCCCAACTACAAAGATACTAAAATCTCACCTAACCACTCTTCTTCTTTCTTA 

ATGAAAGAATGTTTTCAGTCCATCCCAAATTTGTGTGGAC^ 

AATG G AGCCTTTTCTCTI CUT ACTCTTG ACT AGNTGGT AAACGCTCC ATGTTCTTGGC 

CAGAACTCCCTGGTGAGTAGCGTCACTCCCACTTTCCTGTGCAGAACCAAGCCTCCT 

AGAAAACTCCTTTGCANCTGAGTGGGTTGGGACACGCCCTTTNTTTGGG 

Ref9.1 

Sequence of BAC4 using primer C2AS10, which spans nucleotides 3276-3295 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3147-3234. 

TTTANACCNAT>n , ATCCGNGTCAGTTANAGGAGTCTCTGAGAAATTTCCGACAGCGGT 

GTGAGTTTGGGTTCCTTGTAAATATACTCCTTTCCATCTTCATCTTCAAAGAATCCCT 

GTGACATAAAGCACAATTAGAGCTATCCCTGAACGTAAGCCCAGGGCTTACCACCTA 

GGAAGCGTTCTTTTATTACAAGGGGGAAAAAAAGGAATGGGTCTAAAAATCCAGCTG 

AAATG G GCTTTCTG AATG AG AAAGAAAATGCT AAT AAC ATG AAGTCT AGGTGCAAAG 

GTAAAGGAAAAACACAACATTGCAAACTTATTCAAGAATGCAGTCATTAAGTGTTGAG 

TG AAATG AAAGATTTTG G AT AC AAG ACT AAG CTGTCCC AGGG AAGTCTAATGGG AGT 

CAAGCCTGTTTCACTTTCCCAAGAAGCAGAACTCACTANAAAATGATGAGCAGCCCA 

CGACAGGCAGGCTCAGAAGTGGACATGCCTCCCTTCTCCTGATGGCTNCCATGCACA 

CAGGATTTTATGGCATGAACTGAAGCGTTTGGGGGTCTGGAGTAAGTTTAGTAAAAG 

TTAGGTAAAGCTTGTATAAATTGTATTTCT 

AAGACCTGGTAGCTTCAATATTCAAGAAAAATATTTTTCATNTCACCCG 


FIG. 6A (cont.) 
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Ref 10.1 

Sequence of BAC4 using primer C2S11, which spans nucleotides 3167-3186 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3231-3296. 

HCN_'M > T / i ! 'I Y:!r r A rfi a NCCA.cn~iCl A f! A ATfTTN A A CCTNCTTAAACTGTAC TCGGATN 

AATTTGGTTCTGAAAATGTCAAAATGiATACAGGATTCTGGCAAG GTATTGACCATGTT 

TGGANAAGTTTCATAGCAATGTAATGTTGTGATNCGATTACATATNATATATTTTTAA 

ATGTNTATAGAAAAAAACACANGAAAAATATTAAGGATTGTTGGCCCGTGAGTGGCA 

GGTGTATNTTCTTNCTGATCCTTTAGNGCTTTCCATTACATGCNTGACATTAAAAAAA 

NCTTTATCGCCrAATTTTTGAAACATCTAATTTTACAAAATAATTAACCGTNTGGCCAN 

GNATATTNTCATTTTTAGGNCCAGCTATTTAGAAACTCTGACANAAATGAGGGGCTGT 

GGCTTNCCTNCCTNNACTTGNCCCTCTTTCNNGNATGTACCACATGAACTTGNCNCCT 

C7ITTCNNCTNACCGGGTGGCATGTTANAGGACAGGTTGAAACCNCANTNGGGCNGGA 

NTTNGGTNNAATTGGGACACAATGGTACNANGCTCTATNGGAATNGAAACTCTCCCN 

ACNNNCNGTGNNCCNTGGGGAAAATGNGNCNNATTCATTTTN 

Ref 11.1 

Sequence of BAC4 using primer C2S12, -which spans nucleotides 3474-3493 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

AGNANNGTTNNGCAGCTGCANNTCTGGACCCANAGGCCGCANGGGCACGAGCCNGG 

ACACGCTCGGCAAAGAGCTGTCCAGAGGGATTCAGAAGCTTCAGGACTGGAAGGGTC 

TTTCGAGCrCAGTTAGCCACCCCCACACCCATTTCAGTTTCACATTTATCTAGTGCTT 

CCTTTTGAATACTTGGGATGTTTTTCTGTTGATCTGTTGGCACTTCCTTCTTCCACAA 

GACCAGAAGCTCATATCCAATCTAAGGTCACTTACCCTTCTGAGAATCTGATGAAAAT 

GGCGTGCCTTATGTGCCTAGATGCTTTTGCACACAGTCTAAGGTGACTTATGGACTCC 

AGGTCCAGCAGCCACACCCAGTCCTGGGTCTCCGCACAGGGAGGGACCCGTCTTCAC 

ACACCTGTCTCAGGTTCTAGCATTGGGCTGCTTCAGCGGTCTCAGGCTGTGAGTAAA 

TGGGATGTGAGCTTGGATCGCCCCACGCTGTTGNCCCCCGGGGGGCTTGGCCAGCTG 

GCCACTTNGAAATGCCTCCTTTTGCCCAGGAAAGCTCACTGCATTTCAATGGGGNTTN 

TCCACGAAGTTCANCTTTANGGG 
Ref 12.1 

Sequence of BAC4 using primer C2S13, which spans nucleotides 3645-3664 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3683-3699. 

a <tni a a (ZdTNNrrr A NT1ST A AN NC A GCGTGAGNGTTCAGGT GAGCCAGGCAC AGCAGGC 

CGGAGGGCAGCAGGGGACGTCCTTGCCCCTGGGTGACTTGAGAGTCGTTTCCACTAA 

CAAGGTCTACTTGAGAGCCTCGGTTTACCAAGTGATCCCTGCTCCCTTCCCCCAACGT 

NTGTGACATTTCTCCTGATATCAGAGGGGGAGGAAACCTCATGATCCCTGCCCCCCG 

CCCCATGAGGACTGACTGTGGGGACAAAGAGCCAGATCTCATAGACTACCCTGATTT 

GTCAGTATTTGGGGAATTCTGGGTGCCTGATTAGAAGCATCAAGACTCTTCTAAATNC 

AAAGAAGTGTGGAGAGCAGTAGATTTTCCTATAAAACTGGTGTTGCTGGTTTCTATGA 

AAATTGGATCCAAAAAAAGTCCTTAAGTTTACCCTCTTAATGGNATCTTTTGATTAAT 

GGAATTCATTATTTTAATATAGCCCAATCAATCCAATTTTTCTTTATTGGTAGCATTTT 

TATGTTCTCTTTAAAAAAATCTTGGNCTACCTCCAAAATTTCACAGATGTTCTCCTAG 

GGTTTTCCTCCTTTTGGTTCAAGCATCCCATTCAANGTCTTGCAGTCCATTCTGGGG 
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Ref 13.1 

Sequence of BAC4 using primer C2S14, which spans nucleotides 4289-4308 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4321-4448. 

GACTTANATTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATGGCCTACAGAAAAAA 

AAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGATGCCTGGGGGAC 

CTTTTGCCTCGA GGCTGAGCTGGAAAATCTTGAAAATATTTTTTTTTTCCTGTGGCAC 

ATTCAGGTTGAATACAAGAACTATTTTTGTGACTATGTTTTTGATGACCTAAGGGAAC 

TGACCATTGTAATTTTTGTACCAJ^TGAACC^ 

CCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACTAGTCAA 
GCANTlTAGAACCAAAGGCCTATISrraVATAACCGCAAC^ 
AGTACAGNATATTGNTATGTACATATCATTTG^^ 
TATATGTATTACATTTCTACFTTT^ 

NTTGNGATAAATTTNGGCTGTTCCNGTNTATNCNATACNCTTTT 


Ref 14.1 

Sequence of BAC4 using primer C2AS15, which spans nucleotides 4680-4700 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4660-4683. 

ATGAGAATGTAA TACATATATGTACAGAATGCCAGGACTGTATTAACAATGATATGTA 

CATAACAATATACTGTACTACTTTGTACTTTTCAGCATAGTTGCGGTTATTAATATAG 

GCCTTTGGTTCTAAACTGCITGACTAGTTTTAAGCTCACATAATTCCTTAA 

ATITTCTTAA^TGCAAGGAAATGAATATAAAAGCACTAAATCTCCTGGTTCACTGGTA 

CAAAAATTACAATGGTCAGTTCCCTTAGGTCATCAAAAACTAGTCACAAAAATAGTTC 

TTGTATTCAACCTGAATGTGCCACAGGAAAAAAAAAATATTTTCAAGAT^ 

CAGCCTCGAGGCAAAAGGCCCCCAGGCATCAATGTCAGNGCAGCCCTCCTGCCATGT 

AGATCCCAGAACCITTTTTTTCTGTAGGCCATCTATTCTAACACTACTCTGCAGGGAG 

AATAAAATCTAAAGNCCAGCTCAAGAGTGCTACCACACCITTGTTAAGACACAATGAA 

AACTTTGGATATTGGCAGGNGAGATTTAAAJLAAAAATGTGCCCTTTCTTACCACTCCT 

ATAGNAAAGTCTGGTTAAGAAATAACCGTTGGTCTTTATTTTCCTT^ 

CCTTGGGNCTTCCTGGGGCTCGG 
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ASGNLDKNARFS AI YRQD SNKLSNDDMLKIiIADFRKPEKMAKLFVILG^rLD I TIDNVS SD 

FPNYVNSSYIPTKQFETCSKTPITFEVEEEVPCIPKHTQPYTIYTNHIiYVYPKiIiKYDSQ 

VLHHHQNPEFYDE IK 

KSFAKAI^IAICIEFKDSDEEDSQPIiKCIYGRPGGPVFTRSAFAAVIiHHHQNPEFYDEIK 

IELPTQLHEKHHiaiLTFFHVSCDNSSKGSTKKRDVVETQVGYS 

IELPTQIiHEKHHIililiTFFHVS CDNS SKGSTKKRDWE TQVGYSWLPLLKDGRVVTSEQHI 

PVS ANIiPSGYIiGYQELGMGRHYGPE IKWVDGGKPIJLKI S THLVS TVYTQDQHLHNFFQYC 
PVS ANIiPSGYLGYQELGMGRHYGPE I KWVDGGKPLI*KI STHLVS TVYTQDQHLHNFFQYC 

GPGPARS TVS I SLI SNSARV 

QKTTESGAQAIiGNEIjVKYIiKSLHAMEGHVMIAFIjPTIIiNQL 
QKTESGAQALGOTIiVKYIiKSLHAMEGHVM^ 

MEIQVLIRFLSVIIiMQLFWVIiPNMIHEDDVPISCPMV 

MSFLPI I LNQLFKVLV- QNEE DE I TT TVTRV 

NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGIiDRSNSWVNTGGPKAAPWGSNPSPSAES 
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IIHWAQCHSEGTESHLRSYVKYAY^ 

IIHVVAQCHZEGLESHIJlSYVTOfAYKAEPYVASEYKWHEELT^ 

LFH IVSKCHEEGIiDSYLS SFIKY SFRPGKP SAPQAPIilHETIiATMMIALLKQSADFLAIH 

LPDrVAKCHEEQIiDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGI*LK-SNDSPTVK 

TQAMDRS CNRMS S HTE TS SFLQTLTGRLP TKKLFHEE LALQWWC S G — SVR E 

Cadherin 
Cl eavag e 
KLLRYSWFFFDVIjIKSMAQHLIENSIO/K^^ 

EQjLKY SWFFBT5VL I KSMAQHLIEN S KVKLI RNQhFPAS YHHAVE TWNMLMPHI TQKFRD 


Ref. 


KIXKYSWFFFEIIAKSMATYTJLEENKIKX r ] HGQP FPKAYHHALHSLFIiAI T - IVESQYAE 

HVLKki _ . 

SALQQAWFFFEIldjrKSMVHHIjY^NDKIiEAI ] 


SWFFFAI I LKSMAQHLIDTNKIQL^RPQP FPE SYQNEIiDNX,VMVljSDHVrWKYKD 

RKSF FPERFMDD IAALVST IASD IVSRFQK 


6.1 

1.2/1.2/2.1/ 
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NPEASKNANHSIiAVFIKRCFTFMDRGFVFKQIN NY IS — CFAPGDI^KTLFEYKFEFL 

NPEA3KNANHSLAVFIKRCFTFMDRGFVFKQIN NY IS — CFAPCD PKTLFE YKFE FL. 

IPKE SRNWYS1ASF1JCCCLTLMDRGFVFN1.IN DYIS — GFS PKD PKVIiAE YKFE FL 

AI£ETRRATHSVARFLK<lCFTFMDRGCVrFKMVN NYIS — MFSSGDLKTLCQYKFDFL, 

DTEMVERL^WSIAFFIiNDIiSVblDRGFvPSL 
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RWCNHEHYIPLNLPM- 
RWCNHEHY2 PLNLPM- 


PFGKGRIQR- 
•PFGKGRIQR- 


■YQDIxQL DYSLTDEF 

-YQDLQI* DYSLTDEF 


QTICNHEHYIPLNLPM AFAKPKLQR VODSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTIxtUJCSIiTPPASPSPSVSSATlSQSSGFSTNVQDQKIANMFELS — VPF 
MNADTAPTS PCPS IS SQNS S SC S SFQDQKIASMFDRTSRVPA 
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CRNHFLVGHJiREVGTALQEFRE VRLIAISVIiKNIiLIKHSFDDRYASPjSHQARIAT 3 . 1 

CRNH FLVGIililiREVGTALQEFRE VRLIAX SVLKNIjLIKH S FDDRY ASRS HQ ARIAT 

CKHHFLVGLLLRETSIALQDNYE IRYTAI SVTKNLLIKHAFDTRYQHKNQQAKI AQ 

CRKHFLlfelliliREVGFALQEDQD VRHIiAIAVLKNLMAKIi S FDDRYRE PRKQAQ IAS 6 . 1 

RQO HYIiAGLVLTEIiAVIIjD PD AE GLFGIiHKKVTNMVHNIiLSSHD SDPRY S DP Q I KARVAM 
S S TS - SPGLLFrEIiAAAIDAEGE GISEVQRKAVSAIHSIXSSHDIJ5PRCVKPEVKVTCIAA 
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LYIjPLFGLLIEWQRINVRDVSPFFVKAG-MTVKDESIJ^^ 

LYLPLFGLLIEHVQRINVRDVS PFPVNAG -MTVKDE SLALPAVNPLVTPQKG STLDNSLK 

LYLPFVGLLI^NIQRLAGRDTIiYSCAAMPNSASRDEFPCG FTSP — AN— RGSLS 

XYMPLYGbfljLDNMPRIYIJCDLYPFTVNTSNQGSRDDLST^ 

LY1 J PLIG2IME , IVPQLYDFTETHNQRCRPICIATDDYESE SG SMIS 

LYLPLVG 1 1 LDALPQICD FTVAD TKRYR TSGSDEEQE GA GAIT 

4.1/4.2 , 
KDLLGAI S G iXs P Y TT S T PN INSVRNAD SRG SL I S TD S GNSL PE RNS EKSNSLDKHQQ S S 
KDLLGAI S GIAS PYTTS TPNINSVRNAD SRGSLI S TD SGNSLPERNS EKSNSLDKHQQS S 

TDKDTAYG S FQNG HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

KDVXNSIAkFSS lAISTVNHADSRASIJ^IXlSNPSTOEKSSEKTDNCEKIPRPL 

QTVAMAIAGTSVPQ LTRPGSFXLTSTSGRQHT 

QNVALAIACNNFN -liKTSG-IVLS^LPYKQYN 
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Ref . 

TLGNSWROTKI^QSEIKSl^CFLYIIiKSMSDD^ 

tlgnsvvrcdkijdqseiksli^cfly^ 

STRSSVSQYNRIJ>QY13IRi3IiLMCYLYrVKMISED 

AlilGSTLRFDRIiDQAETRSIiLMCFLHIi^TlSYETL 11 . 1/11 .2 

TFSAESSl^IJiICIiWLKN-ADETVLQKWF 

MLNADTTRNIiMICFLWIMKN-ADQSLIRKV^ 


HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 


HQF|qYMGKRYIARNQEGIjG~ FIVHDRKS ■ 
HQFQYMGKRYIAR 


QTLPVSRNRTGMM 6 . 1 
TGMM 


FHFRYMGKHNXARVHDAWIiSKHFGIDRKS QTMPAIiKNRSGVM 

QNFRYI.GKRNIIRKIAAAF — KFVQSTQl^GTLKGSNPSCQTSGIJliAQWMHSTSRHEGHK 

SCFEYKGKKVFERMNSIiTFK — KSKDMRAK I*EE AILGS IGARQEMV 

1XFEYKGKQSSDKVSTQVLQ — KSRDVKAR LEEAI1I1RGE GARGEMM 
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HARLQQL- 
HARLQQL- 


•GSIiDNS- 
-GSLDNS- 


■IiTFNHSYGHSDADVIiHQSIiIiEANIATEVC 
-LTFNHSYGHSDADVXHQSLLEANIATEVC 


QARLQHL SSLESS FTLNHSSTTTE^JDIFHQALLEGNTATEVS 

QHRSQTLPIIRGK NALSNPKL LQMLDNTMT|3NSNEIDIVHHVDTEANIATEGC 

RRSRGQI^RSPSGSAFGSQENLRWRKDOTHVTRQNra 

RRRAPGNDRFP GI*NENI^WKKEQTHV^QANEKIJ3KrKAEl4DQElAIjI S GNLATEAH 
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I/TAIX^LSLFTIAFK^QI*!^^ 7 * X 

LTAIjDTLSLFTIJVFKNQLIADHGHNPLMKKVFDVYX 

KLSRGHSPIWKKVFDVYl^FX^KHQSEMAI.KNVF 

LTVIJDTISFFTQCFKTHFLNNDGHNPLMKKVro 

IiTIIiDLVSLFTQTHQRQLQQCXJCONSIiMKRGFDTYMIiFFQW I 13 . 1 

LIILDTLEIWQTVS — VTES — KESILGGVIiKVXjLHSMACNQSAVYIiQHCFATQRALVS 
LIIH^EMIQASS«AII)C--KDSIJ.GGVI^^ 3 . 1 
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KFPSTFYEGRADMCAALCYEIIJCCCNSKLSSIRTE^ 

KFP STFYEGRADMCAALCYE I LKCCNS KL SSI RTEASQLLYFI24RNNFD Y TGKKS FVRTH 
KFP S TFYEGRADMCASIiC YEVIiKCCNSKLS S IRTEASQLLYFIMRNNFDYTGKKSFVRTH 
KF P S AF FKGRVNMCAAFC YEVLKCC T S KI S S TRNEAS AUliYXIMRNNFEY T KRKT FLRT H 
KFPSAFFQGPADIiCGSFCYEVIjKCOniRSRSTQT IVRSH 
K^PEIJ.FEEETEQCADIXZIJUJ^CSSSIGTIRSHPSASLYI.IMRQNFEIGN — NFAKVK 7 . 1/7 . 2 
KFGDI*LFEEEVEQCFDI*CHQVLHHCSSSMDVTRSQACATLYLI24RFSFGATS NFAKVK 
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LQVIISVSQLIADVVGIGETRFQQSIiSIINfclCANSDRIjIKHTSFSSDVKDIiTKRIRTVlM 
LQVIISVSQLIADWGIGGTRFQQSLSIimC^ 

LQVTISLSQLIADVVGIGGTRFQQSLSIINNCANSDRLIKHTSFSSDVKDIjTKRIRTVIM 
IjQI I IAVSQIiXADVALSGGSRFQESLETCINNFAN SDR^MIiARAFPAEVKDIjTKRIRTVIiM 
LQ)LIKAVSQLIAD-AGIGGSRFQHSIiAITNNFAN^ 

MQVTMSIASLVGRAPDFNEEIILRRSI^TIIAYSEEDTAMQMTPFPT 
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Transmembrane 


Ref . 


HCSA 
KIAA 
rat 

HC j 


HCT A 
KL 
rat 

t-:c4 
hc: 
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ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKN^ 
ATAQMKEHENDPEMLVDLQYSIJtfCSYASTPEiLIUCTW^ 

ATAQHKEHEtTOPEMLVDLQYSIiAKSYASTPEIiRKTWLDSMARIHVKNGE LSEAAMCYVHV 
ATA3MKE HSKDPEMLIDLQYSLAKSYASTPELRKTWLDSMAXIHVKNGE FSEAAMCYVHV 
ATA QMKE U 2 KDPEMLVDLQYS LAH S YAS TPELRRTWLE SMAX I HARNGI L S e)aAMCY I H I 
I TVKMKT. H JFXJPEMLIDLMYRIAKGYQTS PDLRLTWLQNMAGKHSERS* HAE AAQCLVHS 
r TWKSS. r>X PEM-MDLMYRI AXSYQAS PDLRLTWLQNMAEKHTKKKC YTEAAMCLVHA 


SH3 


Sal vax y Urwccv- - 

TALVAE YljrKKFJV- - 
iTAiVAXYLJPJ'XAD- 
tAAL VAX FLHF JOCL- 


AALVAKYL *<U.D- 
AJU.VATYL, a^LXD- 


FRQGCTAFRVI TPN 
■ VQWEP PUjPH s h saclrrs RGGftfFRQGCTAFRVI TPN 
■LAIiQREPPVFPYSHTSCQRKSRGCflFRQGCTAFRVI TPN 

:EPNGCSAFKK! TPN 



RK fliPVGCVTF^NISSN 

HS rifPVGSVS FQN ISSN 
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I Dt EAS>MkXVC^lQD 

I DE EASM-2X- VGMQD 

I OF. tAS>f^ECVGMQD 

TDEEGAMKEtAGMMD 

I KE KGAAKEDSGMHD T PYNEjtJ I LVE QLYMCGE FL.WKS E RYE LIADVNKP I IAV 

VLX k S AVST DWS PDEEGI CSGKYFTE S GLVGLLE QAAAS FSMAGMYEAVNEVYKVL I P I 
V1XESWSEDTLS PDEDGVCAGQYFTE S CLVGLLE QAAE LFSTGGLYE TVNE VYKLVIPI 


- VHFNEDVLME LLE Q CAD GLWKAERYELIAD I YKX X I PI 
-VHFNEDVLME LLE QCADGLWKAERYELI AD IYKLI I PI 
-VHFlTODVLMEIiEQCADGLWKAERIiRAGLLTSINSSSP 
-VHYSEEVIiIiEI*LEQCVNGLWKAERYE 1 1 SE I SKLI GPI 
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KRAYSKV FEVMH 


yxkrJo 

TtKRRDFERIiAHliYDTL 

SMKSGGTLE TTHI lYDTLtTEtEfeSKVtrEVI TR 


YEHRREFENLTQ\ YRTL iGA^TKI LEVMHTKKRLLG- 
FEKQRDFKKLSDI YYDI IRSYLKV. aEWNSEKRLFG- 
KEAKRDAKEXSTJ HGKLJEAPSKT /HQjSTGWERMFG- 
LXAimEFRKLTLTI SKLQ XAE DSXV *EDH — KRMFC- - 


ESGRRLLGT YFRvj vFFGQAAQYQFTD SE TDVE 

A AGSWDLLPGGLFGQ 

TFFRVAFYGQ 


-RYYRVfcFYGQ 
— TiTFRV 3FYG- 
•TYFRVG FFG- 


-FFEDEDGKEYIYKEPKLTPLSEISQRLLICLYSDIQr^ 

CFFEDEDGKXYIYKEPKLTPLSEISQRLLKLYSDKFGSENMKMIQDSGK^ 
CPFEDEDGKXYIYKEPKLTPLSEISQRIJ^KLYSDKFGS 

SFFEEEDGKEYIYKEPKLTGLSEI SLRLVKLYGEKFGTENVKI IQDSDKVNAKELDPKYA 
GFTTLEEEGKEYTYKEPKLTGLSE1 sqrllklyadkfgadnvki IQDSNKVNPKDLDPKYA 
TKFCDII3EQEFVYKEPAITKXAEISHRl{EGFYGERFGEDVVEV^ 

3 KFGDLD E QE FVYKE PAI TKLPE I S HRLEAFYGQC FGAE FVE VT KD S T PVDkJtKLOPNKA 
YIO\mfVT?FFDEKELQERKTEFERSHNIRRFMFI^ 

Y I QVTUVI ? FFDE KELQE RKTE FERS HNIRRFMFEMPFTQTGKRQGGVEE QC KRRT I LTA 

YIQWHVTPFFDEKELOERKTEFERCm^RRFMFEMPFTQTGKRQGG^ 

HIQVTYVXPYFDDKELTERKTEFERNHNISRFVFEAPY^SGK^ 

YIQVTYVTPFFEEKEIEDRKTDFEMHHNIira 

YIQITYVEPYFDTYEMKDRITYFDKi^YNliRRFMYCTPF 

YIQITFVEPYFDEYEMKDRVTYFEK^TFbOLiRRFMYTTPFTIXGRP 
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1 AATTGTAATA CGACTCACTA TAGG GCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCIGtUG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC T7CTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATXATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATCCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATT AT ATT C CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGGTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATrCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCAT GAT CGAAAGTCTC AGACATTGCC TGTTTCCCG1 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GCCCGACCAT GGACATAATC CTCTCATCAA AAAAGTTTTT 
2401 CGGG TTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATTGAGGA ACAACTTTGA TTACACTGGA 
2 641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGC TCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CCAGCAGGCC 
3121 ACCAGGATGT GCTGATGGAC CTCCTTGAGC AGTGCGCAGA 
3201 ATCTAGAAAC TTAXCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAX TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCITCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATG TG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGAXA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAAXCATGC ATGAGCAGAT CTGCCCCCTG 
4081 rCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATCCCCCGTC TGTCGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TG3CAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AAXAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4981 TCTATACAAC TCTTTACT 

I 10 I 20 I 30 | 40 


| 50 | 60 | 70 | 80 

GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG _GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTC CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCC T ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAACA ACGAGCTCCT 2320 
GATGTCTACC TCTGTTTTCT TCAAAAACAT CACTCTCAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTC CATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCAXAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGC C TG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AACACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTT TGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GC TTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CACGATCCTT TCCAAACCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTrC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT AC TGGC TTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT AC TCACATGG GCTTATGCAT TAAG T TTAAT 4800 
TAATG CTTTA TTCTTGTCAT AAAAATCTGC AATATCGAGA 4880 

4898 
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1 ME6HVMIAFL PTII*NQI*FRV LTRATQEEVA VNVTKVTIHV 
81 SMTTXLKPSA DFL.TSNKLLR YSWFFFDVL.I KSMAQHLXEN 
161 SKNANHSLAV FIKRCFTFMD RGEVFKQTNN YISCFAPGDP 
241 OLDYSLTDEF CRNHFLVGLL LREVGTALQE FREVRLIAIS 
321 RXNVRDVSPF PVNAGMTVKD ESIALPAVNP LVTPQKGSTL 
401 TDSGNSliPER NSEKSNSLDK HQQSSTLGNS WKCDKLDQS 
481 EVCLHQFQYM GKKTXABMQE GLGPXVHDRK 3QTLPVSRNR, 
561 lATEVCIiTAIi DTLSLFTLAF KNQUADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR XRTVIWA.TAQ MKEHENDPEM 
801 MCYVHVTATiV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQKLL 
961 FFDEKELQER KTEFERSHNI RRJFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHSQICPLEE 
1 10 | 20 | 30 I 40 


| 50 | 60 | 70 I 80 

VAQCHEEGLE SffLRSYVKYA YKAEPYVASE YKWHEELTK 80 
SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFES* LJCWCNHEHY IPLNLPMPFG KGRXQBYQDI* 240 
VLKNLL.IKHS FDDRYASRSH QARIATLYTiP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPMIHSVR NADSRGSIiIS 400 
EIKSIiLMCFXi YXLKSMSDDA LFTYWNKAST EEIMOFPTIS 480 
TGMMHABLQQ Z.GSU3NSLTP NHSYCHSDAD VLHQ2IXEAN 560 
YLCFLQKHQS ETAUCNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVT ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
IiVDLQYSIAK SYA-STPEXiRK TWLDSMAR1B VKNGDLSEAA 800 
MEDVGMQDVH FNEDVIMEUL EQCADGXWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDUDSKY AYIQVTHVTP 960 
EQCKRRTn/T AIHCFPYVKK RIPVMYQHHT DIOTPIEVAID 1040 
AYARAFIiDDT NTKRYPDNKV KT.T.KEVFRQF VEACGQA1AV 1120 
KTSVLPNSLH IFHAISGTPT STMVHGMTSS SSW 1195 
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I io 

• 1 AAT TGTAATA 
Bl GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
401 GGGTAGATGG 
481 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
901 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGTGTG 
1261 GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1521 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
1841 CACATTGGGA 
1921 TCTTAAAGAG 
2001 ATATCTGAAG 
2081 AGTTCATGAT 
2161 GCAGCCTGGA 
2241 GCCAACATTG 
2321 GGCCGACCAT 
2401 CGGCTTTAAA 
2481 ATGTGTGCGG 
2561 GCTCTACTTC 
2641 CTGTCAGCCA 
2721 GCCAACAGTG 
2801 AATGGCCACC 
2881 ATGCCAGCAC 
2961 GCAGCAATGT 
3041 CGCCTTCAGG 
3121 ACGAGGATGT 
3201 ATCTACAAAC 
3281 GGAACCCAAA 
3361 TCAAAATGAT 
3441 ATCCCCTTCT 
3521 TGAGATGCCA 
3601 TACACTGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4081 TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4481 ATCTACATGG 
4561 AATCAGGGTA 
4641 ATAGAGCAAG 
4721 TCATCGACTC 
4801 TGTGATAAAT 
4881 TGTATACAAG 
I 10 


| 20 
CGACTCACTA 
CGAGTTTTAC 
CCACCTGTTG 
CCCAAGTTGG 
AACCTTCCTT 
AGGCAAGCCA 
GTACTGTCAG 
AAGGCCACGT 
GTCGCGGTTA 
ATATGTTAAG 
TGACCACGAT 
CTGATCAAAT 
TCATCATGCA 
AGAACGCGAA 
AACAACTACA 
CAACCATGAA 
TTGACTACTC 
CAGGAGTTCC 
TGCTTCAAGG 
TCAATGTGAG 
AATCCGCTGG 
TGCTTCTCCA 
ATTCGGGTAA 
AATTCCGTGG 
CATGTCTGAT 
TCTGCCTGCA 
CGAAAGTCTC 
TAACTCTCTC 
CTACTGAGGT 
GGACATAATC 
AAATGTCTTC 
CTCTGTGTTA 
CTGATGAGGA 
GCTGATAGCA 
ACCGGCTTAT 
GCCCAGATGA 
GCCCCACCTC 
GCTATGTCCA 
GTCATTACCC 
GCTGATGGAG 
TTATCATCCC 
CTCACACCGC 
ACAGGATTCT 
TTGACGAAAA 
TTTACGCAGA 
CCCTTATGTG 
TGAGTAAGAA 
GGCAGCGTGA 
TCCTGACAAT 
AACGTCTGAT 
GAAATCATGC 
CAGTGGGACT 
TCTGGGGACT 
CAAGGGGAAG 
CATATTTTTT 
GACTTAGATT 
CAGGGAGGGC 
CAGAACTTAC 
AATAGTGAGC 
CGGGACGCTC 
TTGTGCTGGT 
TCTTTACT 

I 20 


I 30 
TAGGGCGAAT 
ACCATCACCA 
CTCACATTCT 
CTACTCCTGG 
CGGGCTATCT 
CTGCTGAAAA 
AAAACCGAAT 
GATGATCGCC 
ACGTGACTCG 
TACGCGTATA 
TCTCAAGCCT 
CTATGGCTCA 
GCGGAAACCG 
TCATAGCCTT 
TTAGCTGTTT 
CATTATATTC 
ATTAACAGAT 
GGGAGGTCCG 
AGCCATCAGG 
GGATGTGTCA 
TGACGCCGCA 
TATACAACCT 
CAGCCTTCCA 
TTCGCTGTGA 
GATGCTTTGT 
CCAGTTCCAG 
AGACATTGCC 
ACTTTTAACC 
TTGCCTGACA 
CTCTCATGAA 
ACTGCCTTAA 
CGAGATTCTC 
ACAACTTTGA 
GACGTTGTTG 
TAAGCACACC 
AGGAGCATGA 
AGGAAGACGT 
CGTAACAGCC 
CAAACATCGA 
CTCCTTGAGC 
CATTTATGAG 
TGTCGGAAAT 
GGCAAGGTCA 
AGAGTTGCAA 
CCGGGAAGAG 
AAGAAGCGCA 
GGTGGCGGAG 
GTGTTCAGGT 
AAAGTGAAGC 
TAAAGAAGAC 
ATGAGCAGAT 
CCAACAAGCA 
TGCTTTGTCA 
GGGAGAGAAA 
TAAATCTCAC 
TTATTCTTCC 
TGCACTGACA 
TAGTTTTGTC 
TAACTGAGCT 
ATATATGTAT 
CCAGTATATG 


f 40 
TGGGTACCGG 
AAACCCAGAA 
TCCATGTCAG 
CTTCCCCTCC 
TGGCTACCAA 
TTTCCAC TCA 
CTGGAGCCCA 
TTCTTGCCCA 
GGTCATTATT 
AGGCTGAGCC 
TCTCCCGATT 
GCATTTGATA 
TTGTAAATAT 
GCTGTCTTCA 
TGC TCCTGGA 
CGTTGAACTT 
GAGTTCTGCA 
TCTGATCGCC 
CAAGGATAGC 
CCCTTCCCTG 
GAAGGGAAGC 
CAACTCCAAA 
GAAAGGAATA 
TAAACTTGAC 
TTACATAT TG 
TACATGGGGA 
TGTTTCCCGT 
ACAGCTATGG 
GCTCTGGACA 
AAAAGTTTTT 
GGTCCTTAAT 
AAGTGCTGTA 
TTACACTGGA 
GCATTGGGGA 
AGCTTCTCCT 
GAACGACCCA 
GGCTCGACAG 
CTAGTGGCAG 
CGAGGAGGCC 
AGTGCGCAGA 
AAGCGGAGGG 
TTCTCAGAGA 
ACCCTAAGGA 
GAAAGGAAAA 
GCAGGGCGGG 
TCCCTGTCAT 
CTCCGGCAGC 
CAATGCTGGC 
TGCTTAAGGA 
CAGCTCGAGT 
CTGCCCCCTG 
CAATGGTTCA 
TTTGCAAACT 
GGAAATAAAG 
TGGCAATATT 
TTGCAGAGTA 
TTGATGCCTG 
TAGGAGTATG 
AGACACTCAA 
TACATTTCTA 
CAATACACTT 


I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGIGAA 
GAGATGCTGG 
CATGGCCAGG 
AATAXCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTC T 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATGAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 


| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TC TGAATAGA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCAC CTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCC TGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATC TG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
TGTGTTTTCT 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 
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GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
ACACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AG GACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGXTTAGAC 
GATGCAGGAT 
GCTACGAGCT 
GGAAAGGAGT 
XAAATTTGGT 
ACATCCAGGT 
AACATCCGCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTGCGG 
AACTACAGGG 
ACCGAATTCC 
CGGTCGTGTG 
ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GC TTATGCAT 
AAAAATGTGC 


1 BO 
GCTTGATATC 80 
ACTCAGCTGC 160 
GAAGAGGGAT 240 
AGCACATCCC 320 
GAAATTAAAT 400 
CATTTACATA 480 
TAAGAGTCTG 560 
CCAGAGCCAC 640 
TTGGAGAGCC 720 
TGAAGAACTG 800 
CATGGTTTTT 880 
CAGAGATTTC 960 
AGATAATCCA 1040 
GCTTTGTCTT 1120 
GAATTTCTCC 1200 
AAGATACCAA 1280 
GGGAGGTGGG 1360 
CATTC TTTTG 1440 
GATTGAAAAC 1520 
CCCTGGCTCT 1600 
CTGGGCGCCA 1680 
AGGATCTCTC 1760 
AACAAAGTAG 1840 
TTCCTCTACA 1920 
TTTTTTTACA 2000 
TGGGACCCAT 2080 
CAGCAGCTGG 2160 
ATTACTTGAA 2240 
ACCAGCTCCT 2320 
CAGTCTGAAA 2400 
GAGAGCGGAC 2480 
CCTCCCAGCT 2560 
GTCATCATAT 2640 
CAACAACTGT 2720 
GCACGGTGCT 2800 
GCCAAATCCT 2880 
TC TC TCAGAG 2960 
AAGGATGCAC 3040 
GTCCATT TCA 3120 
CATCGCCGAC 3200 
ATATTTACAA 3280 
TCTGAAAATG 3360 
GACTCACGTC 3440 
GCTTCATGTT 3520 
CTGACAGCCA 3 600 
CGAGGTGGCC 3680 
AACTGCAGCT 3760 
GATACAAACA 3840 
TCAAGCCTTA 3920 
AAATGGCGAA 4000 
CTTCACATCT 4080 
ATTACATCTC 4160 
AGACCGAGCA 4240 
AGGAGTTGTA 4320 
TGGTAGACAC 4400 
AGGTTCTGGG 4480 
ATCTGATCGT 4560 
ATTCAACAAC 4640 
TTGTCATTAA 4720 
TAAGTTTAAT 4800 
AATATGGAGA 4880 
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1 MEGHVMIAFL PTILNQLFRV LTRATQEEVA VHVTRVIIHV 
81 SMTTXLKPSA DFLTSNKTiT.R YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTFMD RGFVFKQINN YISCFAPGDP 
241 QLDYSI/TDEF CRNHFX.VGUL LREVGTALQE FREVRLIAIS 
321 RXNVRDVSPF PVNAGMTVKD ESIAI*PAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPTVHDRK SQTLPVSRNR 
561 IATEVCLTAL DTLSL.FTLAF KNQLLADHGH NPIiMKKVFDV 
641 AALCYEILKC CKSKLSSIRT EASQLLYFLM RHNFDYTGKK 
721 SDRXiIKHTSF SSDVKDLTKR IRTVXMATAQ MKEHEKDPEM 
801 MCYVHVTALV AEYI/TRKGVF RQGCTAFRVT TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYXYKEPKLT PLSEISQRLL 
961 FFDEKELQER KTE FERS HNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPLEE 
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| 50 | 60 | 70 | 80 

VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRTQRYQDL 240 
VT.kwt.t.tkhr FDDRYASRSH QARIATLYXP LFGXXXENVQ 320 
DNSLHKDIxLG AISGXASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSLXMCFX. YILKSMSDDA LFTYWNKAST SELMDFFTIS 480 
TGMMHAKLQQ LGSl*DNSLTF NHSYGHSDAD VLHQSLLEAN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHXiQVI ISVSQLIADV VGIGETRFQQ SLSIIKNCAN 720 
LVDLQYSIAK SYASTPELRK TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVLMETJ, EQCADGLWKA ERYEL.IADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFXDDT NTKRYPDNKV KLLKEVFRQF VEACGQAIAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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1 AATTCTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCCGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAACCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAACTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCCGCC AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGC AGGCAACCCA CTGCTGAAAA TTTCCACTCA 
481 A TI 1 1 I I C CA CTACTCTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGC GATCG AAGGCCACCT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA CTCGCGGTTA ACGTGACTCG GGT CAT TAT T 
721 ACTTGAGCTC ATATCTTAAC TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTCATCTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCCGAAACCG TTGTAAATAT 
1041 GAGGCATCTA ACAACCCCAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCACATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTCTC CAACCATGAA CATTATATTC CGTTGAACTT 
1 281 GACCT CCAGC TTG ACTACT C ATTAACAGAT .GAGTTCTGCA 
1361 GACAGCCCTC CXGGACTTCC CGCACGTCCG TCTGATCGCC 
1441 ATGACACATA TGCTTCAAGC AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATCTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTCTC AATCCCCTGC TCACCCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAC ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCCTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAC CATGTCTGAT GATGCTTTGT T TACAT ATTG 
2001 ATATCTGAAG TCTCCCTCCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CCAAACTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTC CTACTGACCT TTGCCTGACA GCTCTGGACA 
2321 GGCC GACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATCTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGCA ACAACTTTGA TTACAC TGGA 
2641 CTGTCAGCCA CCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATT TATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCC TAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGAC GAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAAC TCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCT TAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC T GCAC TGACA TTGATGC CTG 
4561 AATCAGGG T A CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACAT TTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGC TGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGKCGTGGG -1360 
ATCAGTGTGC TCAAGAACC T GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCC TGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACGAGTC ATTAC TTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATT TGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGC T 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAAC G GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACAT GATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATGACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTC TAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGAT TTGTG CTATTATCTC ATTCAACAAC 4 640 
TTAATCC GC T ACTGGC TTCA AGTCAGAAC T TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGT TTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTAC A TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC AT TAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACT T GAC 
1921 TCTTAAAGAG CATGTC TGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAAC TTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC C TAG TGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT AGAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GAC TTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4861 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATC GATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
C TGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATT TACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATC CTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACA AGTT T GAAT TTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAG GATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACC TG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAAGATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTC T ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTC TGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
AC TCCAAGCT GAGCTC CATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGAT GC TGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATC CATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAG TTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACC C CAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG AT TACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGAC CGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGT TAGAAT AGATGGCC TA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4 640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCCTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 CGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGCTACATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGACGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
681 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTCCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAAC CATGAA CATTATA.TTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC AT TAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACACATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CAT T TAT GAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA AC C C TAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA T CC TGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GAC TTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATC TACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATCGATAA GC T T GAT ATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGC CC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTC TG • 560 
C TATC C T AAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAAC TG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTC TCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGAT AC CAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GG GAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT AC TGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATC TG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTAC TTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGAC TTAAC C AAAAGGATAC GCAC GGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCC TG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TCTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTAT GCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAAC CC CAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCC TTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GAC TTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCAT TAA 4720 
CATT TTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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TAACTGAGCT 
ATATATGTAT 
CCAGTATATG 


I 40 

TGGGTACCGG 
AAACCCAGAA 
TCCATGTCAG 
CTTCCCCTCC 
TGGCTACCAA 
TTTCCACTCA 
CTGGAGCCCA 
TTCTTGCCCA 
GGTCATTATT 
AGGCTGAGCC 
TCTGCCGATT 
GCATTTGATA 
TTGTAAATAT 
GCTGTCTTCA 
TGC TCCTGGA 
CGTTGAACTT 
GAGTTCTGCA 
TCTGATCGCC 
CAAGGATAGC 
CCCTTCCCTG 
GAAGGGAAGC 
CAACTCCAAA 
GAAAGGAATA 
TAAACTTGAC 
TTACATATTG 
TACATGGGGA 
TGTTTCCCGT 
ACAGCTATGG 
GCTCTGGACA 
AAAAGTTTTT 
GGTCCTTAAT 
AAGTGC TGTA 
TTACAC TGGA 
GCATTGGGGA 
AGCTTCTCCT 
GAACGACCCA 
GGCTCGACAG 
CTAGTGGCAG 
CGAGGAGGCC 
AGTGCGCAGA 
AAGCGGAGGG 
TTCTCAGAGA 
ACCCTAAGGA 
GAAAGGAAAA 
GCAGGGCGGG 
TCCCTGTCAT 
CTCCGGCAGC 
CAATGCTGGC 
TGC T TAAGGA 
CAGCTCGAGT 
CTGCCCCCTG 
CAATGG TTCA 
TTTGCAAACT 
GGAAATAAAG 
TGGCAATATT 
TTGCAGAGTA 
TTGATGCCTG 
TAGGAGTATG 
AGACACTCAA 
TACATTTCTA 
CAATACACTT 


I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCC TTAGGA 
CTATC CTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 


| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACC TTC 
CCCTCTTTGA 


| 70 
GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGT AC CT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 


t 80 
GCTTGATATC 
ACTCAGC TGC 
GAAGAGGGAT 
AGCACATCCC 
GAAATTAAAT 
CATTTACATA 
TAAGAGTCTG 
CCAGAGCCAC 
TTGGAGAGCC 
TGAAGAACTG 
CATGGTTTTT 
CAGAGAT TTC 
AGATAATCCA 
GCTTTGTCTT 
GAATTTCTCC 


ACCAATGCCA 

GAAACCACTT 

ATCAGTGTGC 

CACCCTCTAC 

TGAACGCGGG 

ACCCTGGACA 

CATCAACAGT 

GTGAGAAGAG 

CAGTCTGAGA 

GAACAAGGCT 

AGCGATACAT 

AACAGAACAG 

CCACTCGGAC 

CGCTTTCTCT 

GATGTCTACC 

TTATAAGTTT 

ACTCCAAGCT 

AAGAAGTCCT 

AACCAGATTC 

CTGATGTGAA 

GAGATGCTGG 

CATGGCCAGG 

AATATC TCAC 

TCCATGATGG 

TGGACTCTGG 

ATTTCTTTGA 

CTCCTTAAAC 

TCTGGATTCT 

CAGAGTTTGA 

GTGGAAGAGC 

GTACCAGCAC 

TGTGCTCCTC 

CCACTAGCAT 

AGTTTTCAGG 

ATCAGGAAGA 

GAGGAGAAGA 

CGGGATGACC 

CAGGATGCTT 

AACAACGTTA 

CAAAGTTTTC 

GTGTTAGAAT 

GGGGACCTTT 

TTGTATGACT 

TTAATCCGCT 

CATTTTTAAT 

TAATGGTTTA 


TTTGGAAAAG 

CTTGGTGGGA 

TCAAGAACCT 

CTGCCTCTGT 

CATGACCGTG 

ACAGCCTGCA 

GTGAGAAATG 

CAATTCCCTG 

TTAAGAGCCT 

TCAACATCTG 

AGCCAGGAAC 

GAATGATGCA 

GCAGATGTTC 

ATTTACATTG 

TGTGTTTTCT 

CCCTCAACAT 

GAGCTC CATC 

TTGTCCGGAC 

CAGCAGTCCC 

GGACTTAACC 

TGGACCTCCA 

ATCCATGTCA 

ACGGAAAGGC 

AAGACGTGGG 

AAAGCC GAGC 

AGATGAAGAT 

TGTACTCGGA 

AAGTATGCAT 

GAGATCCCAC 

AGTGCAAACG 

CACACTGACC 

GGCCGAGGTG 

ATGCGCGAGC 

CAATTTGTGG 

AATGAAAGCC 

CGAGCGTCTT 

AGCTCGTCTT 

TCCAAAGCCA 

TTTCTTAACA 

ATTGTGTCTT 

AGATGGCCTA 

TGCCTCGACT 

AGGATT TGTG 

ACTGGCTTCA 

AC TCACATGG 

TTCTTGTCAT 


GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGC CAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAGGAT 
GCTACGAGCT 
GGAAAGGAGT 
TAAATTTGGT 
ACATCCAGGT 
AACATCCGCC 
GCGCACCATC 
TGAAC C C CAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTGCGG 
AACTACAGGG 
ACCGAATTCC 
CGGTCGTGTG 
ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCT TATGCAT 
AAAAATGTGC 


AAGAT AC CAA 
GGGAGGTGGG 
CATTCTTTTG 
GATTGAAAAC 
CCCTGGCTCT 
CTGGGCGCCA 
AGGATCTCTC 
AACAAAGTAG 
TTCCTCTACA 
TTTTTTTACA 
TGGGACCCAT 
CAGCAGCTGG 
ATTACTT GAA 
ACCAGCTCCT 
CAGTC TGAAA 
GAGAGCGGAC 
CCTCCCAGCT 
GT CAT CATAT 
CAACAAC TGT 
GCACGGTGCT 
GCCAAATCCT 
TCTCTCAGAG 
AAGGATGCAC 
GTCCATTTCA 
CATCGCCGAC 
ATATTTACAA 
TCTGAAAATG 
GACTCAC GTC 
GCTTCATGTT 
CTGACAGCCA 
CGAGGTGGCC 
AACTGCAGC T 
GATACAAACA 
TCAAGCCTTA 
AAATGGCGAA 
CTTCACATCT 
ATTACATCTC 
AGAC CGAGCA 
AGGAGTTGTA 
TGGTAGACAC 
AGGTTCTGGG 
ATCTGATCGT 
ATTCAACAAC 
TTGTCATTAA 
TAAGTTTAAT 
AATATGGAGA 
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FIG.10F 


NSDCCID; <WO 0231 1 17A2_I_> 


WO 02/31117 


81/114 


4 

PCT/USO 1/32202 


1 MEGHVMIAFL 

81 SMTTILKPSA 

161 SKNAHHSLAV 

241 QLDYSLTDEF 

321 RXNVRDVSPF 

401 TDSGHSLPER 

481 EVCLHQFQYM 

561 IATEVCLTAL 

641 AALCYEILKC 

721 SDRLIKHTSF 

801 MCYVHVTALV 

881 KLIIPIYEKR 

961 FFDEKEIjQER 

1041 EMSKKVAELR 

1121 NERLIKEDQL 
I 10 


| 20 
PTll^QIiFRV 
DFLTSMKT.T.R 
FIKRCFTFMD 
CBNHEXVGLL 
PVNAGMTVKD 
NSEKSNSLDK 
GKRYIARNQE 
DTIiSLFTLAF 
CNSKLSSIRT 
SSDVKDLTKR 
AEYLTRKGVF 
RDFFEDEDGK 
KTEFERSHNI 
QLC S S AEVDM 
EYQEEMKANY 

1 20 


| 30 
LTBATQEEVA 
YSWFFFDVLI 
RGFVFKQINN 
LREVGTALQE 
ESIALPAVNP 
HQQSSTLGNS 
GLGPXVHDRK 
KNQLLADHGH 
EASQLLYFLM 
IRTVIWATAQ 
RQGCTAFRVI 
EYIYKEPKLT 
RRFMFEMPFT 
IKLQUCLQGS 
REMAKELSEI 

1 30 


f 40 
VNVTRVIIHV 
KSMAQHLIEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTIi 
WRCDKLDQS 
SQTLPVSRNR 
KPIiMKKVFDV 
RNNFDYTGKK 
MKEHEKDPEM 
TPNIDEEASM 
PLSEISQRIi 
QTGKRQGGVE 
VSVQVNAGPL 
MHEQICPUEE 

I 40 


I 50 
VAQCHEEGLE 
SKVKLLRNQR 
KTLFEYKFEF 
VLKNIXIKHS 
DNSLHKDLLG 
EIKSLXMCFL 
TGMMHAKLQQ 
YLCFLQKHQS 
SFVRTHLQVI 
LVDIiQY SIiAK 
MEDVGMQDVH 
KLYSDKFGSE 
EQCKKRTILT 
AYARAFXiDDT 
KTSVX1PNSI1H 

I 50 


| 60 
SHLRSYVKYA 
FPASYHHAAE 
LRWCNHEHY 
FDDRYASRS H 
AISGIASPYT 
Y ILKSMSDD A 
LGSLDNSLTF 
ETALKNVFTA 
ISVSQLIADV 
S YAS TPEliRK 
FNEDVXMELL 
KVKMIQDSGK 
AIHCFPYVKK 
NTKRYPDNKV 
IFKAISGTPT 

I 60 


| 70 
YKAEPYVASE 
TWNMLMPHI 
IPLNLPMPFG 
QARIATLYU? 
TSTPNINSVR 
LFTYWNKAST 
NHSYGHSDAD 
IASLIYKFPS 
VGIGETRFQQ 
TWXDSMARIR 
EQCADGItWKA 
VNPKDUDSKY 
RIPVMYQHHT 
KT.T.KEVFRQF 
STMVHGMTSS 

| 70 


I 80 
YKTVHEEI*TK 80 
TQKFGDNPEA 160 
KGRIQRYQDL 240 
LFGLLIENVQ 320 
NADSRGSLIS 400 
SELMDFFTIS 480 
VLHQSLLEAN 560 
TFYEGRADMC 640 
SLSIINNCAN 720 
VKNGDLSEAA 800 
ERYELIADIY 880 
AYIQVTHVIP 960 
DLNPIEVAID 1040 
VEACGQA1AV 1120 
SSW 1195 

t 80 


FIG. 10F (cont.) 


NSDOCID: <WO 0231117A2 I > 


WO 02/31117 
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82/114 


I 10 

1 AATTGTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
401 GGGTAGATGG 
481 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
801 ACGAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGTGTG 
12.81 .GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1521 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
1841 CACATTGGGA 
1921 TCTTAAAGAG 
2001 ATATCTGAAG 
2081 AGTTCATGAT 
2161 GCAGCCTGGA 
2241 GCCAACATTG 
2321 GGCCGACCAT 
2401 CGGCTTTAAA 
2481 ATGTGTGCGG 
2561 GCTCTACTTC 
2641 CTGTCAGCCA 
2721 GCCAACAGTG 
2801 AATGGCCACC 
2881 ATGCCAGCAC 
2961 GCAGCAATGT 
3041 CGCCTTCAGG 
3121 ACGAGGATGT 
3201 ATCTACAAAC 
3281 GGAACCCAAA 
3361 TCAAAATGAT 
3441 ATCCCCTTCT 
3521 TGAGATGCCA 
3601 TACACTGCTT 
3681 ATT GAC GAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4081 TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4481 ATCTACATGG 
4561 AATCAGGGTA 
4 641 ATAGAGCAAG 
4721 TCATCGACTC 
4801 TGTGATAAAT 
4881 TGTATACAAG 


| 20 
CGACTCACTA 
CGAGTTTTAC 
CCACCTGTTG 
CCCAAGTTGG 
AACCTTCCTT 
AGGCAAGCCA 
GTACTGTCAG 
AAGGCCACGT 
GTCGCGGTTA 
ATATGTTAAG 
TGACCACGAT 
CTGATCAAAT 
TCATCATGCA 
AGAACGCGAA 
AACAACTACA 
CAACCATGAA 
TT.GACTACTC 
CAGGAGTTCC 
TGCTTCAAGG 
TCAATGTGAG 
AATCCGCTGG 
TGCTTCTCCA 
ATTCGGGTAA 
AATTCCGTGG 
CATGTCTGAT 
TCTGCCTGCA 
CGAAAGTCTC 
TAACTCTCTC 
CTACTGAGGT 
GGACATAATC 
AAATGTCTTC 
CTCTGTGTTA 
CTGATGAGGA 
GCTGATAGCA 
ACCGGCTTAT 
GCCCAGATGA 
GCCCGAGCTC 
GCTATGTCCA 
GTCATTACCC 
GCTGATGGAG 
TTATCATCCC 
CTCACACCGC 
ACAGGATTCT 
TTGACGAAAA 
TTTACGCAGA 
CCCTTATGTG 
TGAGTAAGAA 
GGCAGCGTGA 
TCCTGACAAT 
AACGTCTGAT 
GAAATCATGC 
CAGTGGGACT 
TGTGGGGACT 
CAAGGGGAAG 
CATATTTTTT 
GACTTAGATT 
CAGGGAGGGC 
CAGAAC TT AC 
AATAGTGAGC 
CGGGACGGTC 
TTGTGCTGGT 
TCTTTACT 

i 20 


| 30 
TAGGGCGAAT 
ACCATCACCA 
CTCACATTCT 
CTACTCC TGG 
CGGGCTATCT 
CTGCTGAAAA 
AAAACCGAAT 
GATGATCGCC 
ACGTGACTCG 
TACGCGTATA 
TCTCAAGCCT 
CTATGGCTCA 
GCGGAAACCG 
TCATAGCCTT 
TTAGCTGTTT 
CATTATATTC 
ATTAACAGAT 
GGGAGGTCCG 
AGCCATCAGG 
GGATGTGTCA 
TGACGCCGCA 
TATACAACCT 
CAGCCTTCCA 
TTCGCTGTGA 
GATGCTTTGT 
CCAGTTCCAG 
AGACATTGCC 
ACTTTTAACC 
TTGCCTGACA 
CTCTCATGAA 
ACTGCCTTAA 
CGAGATTCTC 
ACAACTTTGA 
GACGTTGTTG 
TAAGCACACC 
AGGAGCATGA 
AGGAAGACGT 
CGTAACAGCC 
GAAACATCGA 
CTCCTTGAGC 
CATTTATGAG 
TGTCGGAAAT 
GGCAAGGTCA 
AGAGTTGCAA 
CCGGGAAGAG 
AAGAAGCGCA 
GGTGGCGGAG 
GTGTTCAGGT 
AAAGTGAAGC 
TAAAGAAGAC 
ATGAGCAGAT 
CCAACAAGCA 
TGCTTTGTCA 
GGGAGAGAAA 
TAAATCTCAC 
TTATTCTTCC 
TGCACTGACA 
TAGTTTTGTC 
TAACTGAGCT 
ATATATGTAT 
CCAGTATATG 


I 40 
TGGGTACCGG 
AAACCCAGAA 
TCCATGTGAG 
CTTCCCCTCC 
TGGCTACCAA 
TTTCCACTCA 
CTGGAGCCCA 
TTCTTGCCCA 
GGTCATTATT 
AGGCTGAGCC 
TCTGCCGATT 
GCATTTGATA 
TTGTAAATAT 
GCTGTCTTCA 
TGCTCCTGGA 
CGTTGAACTT 
GAGTTC TGCA 
TCTGATCGCC 
CAAGGATAGC 
CCCTTCCCTG 
GAAGGGAAGC 
CAACTCCAAA 
GAAAGGAATA 
TAAACT TGAC 
TTACATATTG 
TACATGGGGA 
TGTTTCCCGT 
ACAGCTATGG 
GCTCTGGACA 
AAAAGTTTTT 
GGTCCTTAAT 
AAGTGCTGTA 
TTACACTGGA 
GCATTGGGGA 
AGCTTCTCCT 
GAACGACCCA 
GGC TCGACAG 
CTAGTGGCAG 
CGAGGAGGCC 
AGTGCGCAGA 
AAGCGGAGGG 
TTCTCAGAGA 
ACCCTAAGGA 
GAAAGGAAAA 
GCAGGGCGGG 
TCCCTGTCAT 
CTCCGGCAGC 
CAATGCTGGC 
TGCTTAAGGA 
CAGCTCGAGT 
CTGCCCCCTG 
CAATGGTTCA 
TTTGCAAACT 
GGAAATAAAG 
TGGCAATATT 
TTGCAGAGTA 
TTGATGCCTG 
TAGGAGTATG 
AGACACTCAA 
TACATTTCTA 
CAATACAC TT 


30 


I 


40 


I 50 | 60 | 70 I 80 

GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATT TACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATC CTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGG TACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCC TCTTTGA ATACAAGTTT GAATTTCTCC 1200 
AC CAATGC CA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCC TGGACA ACAGCC TGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT AC TGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AAC TTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTC TGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2 640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATC CCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACAT GATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG AT TACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACC GAGCA 4240 
AACAACGTTA TTTCTTAACA GAC TTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTC TT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATT TGTG CTATTATCTC ATTCAACAAC 4 640 
TTAATCCGCT ACTGGC TTCA AGTCAGAACT TTGTCAT TAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 

4898 
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I 10 | 20 I 30 | 40 

1 MEGHVMIAFL PTILNQLFRV LTRATQEEVA VNVTRVTIHV 
81 SMTTILKPSA DFXTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSLAV FIKRCFTFMD RGFVFKQINN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGIiL LREVGTALQE FREVKLIAXS 
321 RINVRDVSPF PVNAGMTVKD ESLALPAVNP LVTPQKGSTL 
401 TDSGNSIiPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 IATEVCLTAL DTLSLFTLAF KNQLIADHGH NPLMKKVFDV 
641 AALC YE IL.KC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVLMATAQ MKEHEKDPEM 
801 MCYVHVTALV AEYXTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKELQER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAE1R QLC S S AEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NEKLIKEDQXi EYQEEMKANY REMAKE1SEI MHEQICPIiEE 
| 10 | 20 | 30 | 40 


| 50 I 60 | 70 | 80 

VAQCHEEGLE SELRSYVKYA YKAEPYVASE YKWHEELTK 80 
SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDIi 240 
VLKNLLXKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPKINSVR NADSRGSLIS 400 
EIKSKLMCFX. YILKSMSDDA UTTYWNKAST SELMDFFTIS 480 
TGMMHAKLQQ LGSU5NSLTF NHSYGHSDAD VLHQSliKEAN 560 
YLCFLQKHQS ETALKNVFTA I*RSLIYKFPS TFYEGRADMC 640 
SFVRTKLQVT ISVSQLIADV VGIGETRFQQ SLSIJNNCAN 720 
LVDLQYSLAK SYASTPEUUC TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVLMELX EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVTP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KULKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGT TAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCC T TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAfiTGTG CAACCATGAA CATtAtAxTC CGTTGAACtx 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACT TGAC 
1921 TCTTAAAGAG CATGTC TGAT GATGC TTTGT TTACATATfTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCC TGGA TAACTCTCTC AC TTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGC TGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA C GTAACAGC C CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATC T ACAAAC TTATCATCCC CATT TATGAG* AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 AT TGAC GAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCC TGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CAT AT T TT TT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GAC TTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCAC TGAGA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4 641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATC GATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC AC TCAGC TGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAG C GAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGAT TTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA xTTGGAAAAG GCAGGAxxCA AAGAT AC CAA 1260 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATT C TT T TG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAAC GC GGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTC TGAGA TTAAGAGCC T ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC AT TAC TTGAA 2240 
CGCTTTCTCT ATT TACATT G GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGC TC CATC AGGAC GGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA G T CAT CATAT 2640 
AAC CAGATT C CAGCAGTCCC TGTCCATCAT CAACAAC TGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGAT GC TG G TGGACCTCCA GTACAGCCTG GC CAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATT TCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GC TTCATGT T 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GT AC CAGCAC CACACTGACC TGAAC C C CAT CGAGGTGGCC 3680 
TGTGC TCCTC GGCCGAGGTG GACAT GAT CA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCAC TGGGG AGACC GAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGC CGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4 640 
TxAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 

4898 
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I 10 

1 MEGHVMIAFL 
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161 SKNANHSXiAV 
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321 RINVRDVSPF 
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481 EVCLHQFQYM 
561 IATEVCLTAL 
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721 SDRLIKHTSF 
801 MCYVHVTALV 
881 KLIIPIYEKR 
961 FFDEKELQER 
1041 EMSKKVAELR 
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I 10 


| 20 
PTILNQLFRV 
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SLSXINNCAN 720 
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Exon 1A (-182 to -102) 

GCAGGGGAAAAACCTGGCCCCATGATTCACTTACTTCCCACCGGATCTCTCCCATGACACGTGAGGATTA. 
TTACAATTTAA -102 

Exon IB (-219 to -102) 

TTATCCCTTTACTACTTGCGAAGTGAGTTCGGTAGATGGGAGTGGAGAAGAGAACCTTAGAATCATTGTrTAGTCTTCAT 
CTTTCACAGCTCAGGCTGAAGGCCTTTCCTTGCTGAGA -102 

Exon 1C (-143 to -102) 

GCGGCAGAGCGTGTCTGAGGTGGTGCGCGGGTCCGTGCTCCT -102 


Exon2 and the rest of human CLASP 2 cDNA 

-101 -79 

GGCAAAGCCAAAGCTAATTGAGC 

-78 ~ 1 
AAGCTAATTGAGCCAC* C r»ACT ATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGTTTACGGGAG 

1/1 

ATG CTG CTC T7C CCT TAC GAT GAC TTT CAG ACG GCC ATC CTG AGA CGA CAG GGT CGA TAC 
Met leu leu phe pro tyr asp asp phe gin thr ala ile leu arg arg gin gly arg tyr 

61/21 91/31 

ATA TGC TCA ACA GT3 CCT GCC AAG GCG GAA GAG GAA GCA CAG AGC TTG TTT GTT ACA GAG 
ile cys ser thr val pro ala lys ala glu glu glu ala gin ser leu phe val thr glu 

121/41 151/51 

TGC ATC AAA ACC TAT AAC TCT GAC TGG CAT CTT GTG AAC TAT AAA TAT GAA GAT TAC TCA 
cys ile lys thr tyr asn ser asp trp his leu val asn tyr lys tyr glu asp tyr ser 

181/61 211/71 

GGA GAG TTT CGA CAG CTT CCG AAC AAA GTG GTC AAG TTG GAT AAA CTT CCA GTT CAT GTC 
gly glu phe arg glr. leu pro asn lys val val lys. leu asp lys- leu pro val his val 

241/81 271/91 

TAT GAA GTT GAC GAG GAG GTC GAC AAA GAT GAG GAT GCT GCC TCC CTT GGT TCC CAG AAG 
tyr glu val asp glu glu val asp lys asp glu asp ala ala ser leu gly ser gin lys 

301/101 331/111 

-GGT GGG ATC ACC AAG CAT GGC TGG CTG TAC AAA GGC AAC ATG AAC AGT GCC ATC AGC GTG 
gly gly ile thr lys his gly trp leu tyr lys gly asn met asn ser ala ile ser val 

361/121 391/131 

ACC ATG AGG TCA TTT AAG AGA CGA TTT TTC CAC CTG ATT CAA CTT GGC GAT GGA TCC TAT 
thr met arg ser phe lys arg arg phe phe his leu ile gin leu gly asp gly ser tyr 

421/141 451/151 

AAT TTG AAT TTT TAT AAA GAT GAA AAG ATC TCC AAA GAA CCA AAA GGA TCA ATA TTT CTG 
asn leu asn phe tyr lys asp glu lys ile ser lys glu pro lys gly ser ile phe leu 

481/161 511/171 

GAT TCC TGT ATG GGT GTC GTT CAG AAC AAC AAA GTC AGG CGT TTT GCT TTT GAG CTC AAG 
asp ser cys met gly val val gin asn asn lys val arg arg phe ala phe glu leu lys 
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541/181 571/191 

ATG CAG GAC AAA AGT AGT TAT CTC TTG GCA GCA GAC AGT GAA GTG GAA ATG GAA GAA TGG 
met gin asp lys ser ser tyr leu leu ala ala asp ser glu val glu met glu glu trp 

601/201 631/211 

ATC ACA ATT CTA AAT AAG ATC CTC CAG CTC AAC TTT GAA GCT GCA ATG CAA GAA AAG CGA 
ile thr ile leu asn lys ile leu gin leu asn phe glu ala ala met gin glu lys arg 

661/221 691/231 

AAT GGC GAC TCT CAC GAA GAT GAT GAA CAA AGC AAA TTG GAA GGT TCT GGT TCC GGT TTA 
asn gly asp ser his glu asp asp glu gin ser lys leu glu gly ser gly ser gly leu 

721/241 751/251 

GAT AGC TAC CTG CCG GAA CTT GCC AAG AGT GCA AGA GAA GCA GAA ATC AAA CTA AAA AGT 
asp ser tyr leu pro glu leu ala lys ser ala arg glu ala glu ile lys leu lys ser 

781/261 811/271 

GAA AGC AGA GTC AAA CTT TTT TAT TTG GAC CCA GAT GCC CAG AAG CTT GAC TTC TCA TCA 
glu ser arg val lys leu phe tyr leu asp pro asp ala gin lys leu asp phe ser ser 

841/281 871/291 

GCT GAG CCA GAA GTG AAG TCA TTT GAA GAG AAG TTT GGA AAA AGG ATC CTT GTC AAG TGC 
ala glu pro glu val lys ser phe glu glu lys phe gly lys arg ile leu val lys cys 

901/301 931/311 

AAT GAT TTA TCT TTC AAT TTG CAA TGC TGT GTT GCC GAA AAT GAA GAA GGA CCC ACT ACA 
asn asp leu ser phe asn leu gin cys cys val ala glu asn glu glu gly pro thr thr 

961/321 991/331 

AAT GTT GAA CCT TTC TTT GTT ACT CTA TCC CTG TTT GAC ATA AAA TAC AAC CGG AAG ATT 
asn val glu pro phe phe val thr leu ser leu phe asp ile lys tyr asn arg lys ile 

1021/341 1051/351 

TCT GCC GAT TTC CAC GTA GAC CTG AAC CAT TTC TCA GTG AGG CAA ATG CTC GCC ACC ACG 
ser ala asp phe his val asp leu asn his phe ser val arg gin met leu ala thr thr 

1081/361 1111/371 

TCC CCG GCG CTG ATG AAT GGC AGT GGG CAG , AGC CCA TCT GTC CTC AAG GGC ATC CTT CAT 
ser pro ala leu met asn gly ser gly gin ser pro ser val leu lys gly ile leu his 

1141/381 ' 1171/391 ? 

GAA GCC GCC ATG CAG TAT CCG AAG CAG GGA ATA TTT TCA GTC ACT TGT CCT CAT CCA GAT 
glu ala ala met gin tyr pro lys gin gly ile phe ser val thr cys pro his pro asp 

1201/401 1231/411 

ATA TTT CTT GTG GCC AGA ATT GAA AAA GTC CTT CAG GGG AGC ATC ACA CAT TGC GCT <3AG 
ile phe leu val ala 'arg ile glu lys val leu gin gly ser ile thr his cys ala glu 

1261/421 1291/431 

CCA TAT ATG AAA AGT TCA GAC TCT TCT AAG GTG GCC CAG AAG GTG CTG AAG AAT GCC AAG 
pro tyr met lys ser ser asp ser ser lys val ala gin lys val leu lys asn ala lys 

1321/441 1351/451 

CAG GCA TGC CAA AGA CTA GGA CAG TAT. AGA ATG CCA TTT GCT TGG GCA GCA AGG ACA TTG 
gin ala cys gin arg leu gly gin tyr arg met pro phe ala trp ala ala arg thr leu 
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1381/461 - 1411/471 

TTT AAG GAT GCA TCT GGA AAT CTT GAC AAA AAT GCC AGA TTT TCT GCC ATC -TAC AGG CAA 
phe lys asp ala ser gly asn leu asp lys asn ala arg phe ser ala ile tyr arg gin 

1441/481 . 1471/491 

GAC AGC AAT AAG CTA TCC AAT GAT GAC ATG CTC AAG TTA CTT GCA GAC TTT CGG AAA CCT 
asp ser asn lys leu ser asn asp asp met leu lys leu leu ala asp phe arg lys pro 

1501/501 1531/511 

GAG AAG ATG GCT AAG CTC CCA GTG ATT TTA GGC AAT CTA GAC ATT ACA ATT GAT AAT GTT 
glu lys met ala lys leu pro val ile leu gly asn leu asp ile thr ile asp asn val 

1561/521 1591/531 

TCC TCA GAC TTC CCT AAT TAT GTT AAT TCA TCA TAC ATT CCC ACA AAA CAA TTT GAA ACC 
ser ser asp phe pro asn tyr val asn ser ser tyr ile pro thr lys gin phe glu thr 

1621/541 1651/551 

TGC AGT AAA ACT CCC ATC ACG TTT GAA GTG GAG GAA TTT GTG CCC TGC ATA CCA AAA CAC 
cys ser lys thr pro ile thr phe glu val glu glu phe val pro cys ile pro lys his 

1681/561 1711/571 

ACT CAG CCT TAC ACC ATC TAC ACC AAT CAC CTT TAC GTT TAT CCT AAG TAC TTG AAA TAC 
thr gin pro tyr thr ile tyr thr asn his leu tyr val tyr pro lys tyr leu lys -tyr 

1741/581 1771/591 

GAC AGT CAG AAG TCT TTT GCC AAG GCT AGA AAT ATT GCG ATT TGC ATT GAA TTC AAA GAT 
asp ser gin lys ser phe ala lys ala arg asn ile ala ile cys ile glu phe lys asp 

1801/601 1831/611 

TCA GAT GAG GAA GAC TCT CAG CCC CTT AAG TGC ATT TAT GGC AGA CCT GGT GGG CCA GTT 
ser asp glu glu asp ser gin pro leu lys cys ile tyr gly arg pro gly gly pro val 

1861/621 1891/631 

TTC ACA AGA AGC GCC TTT GCT GCA GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT 
phe thr arg ser ala phe ala ala val leu his his his gin asn pro glu phe tyr asp 

1921/641 1951/651 

GAG ATT AAA ATA GAG TTG CCC ACT CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC 
glu ile lys ile glu leu pro thr gin leu his glu lys his his leu leu leu thr phe 

1981/661 2011/671 

TTC CAT GTC AGC TGT GAC AAC TCA AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA 
phe his val ser cys asp asn ser ser lys gly ser thr lys lys arg asp val val glu 

2041/681 2071/691 

ACC CAA GTT GGC TAC TCC TGG CTT CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG 
thr gin val gly tyr ser trp leu pro leu leu lys asp gly arg val val thr ser' glu 

2101/701 2131/711 

CAG CAC ATC CCG GTC TCG GCG TAC CTT CCT TCG GGC CAT CTT GGC TAC CAA GAG CTT GGG 
gin his ile pro val ser ala tyr leu pro ser gly his leu gly tyr gin glu leu gly 

2161/721 2191/731 

ATG GGC AGG CAT TAT GGT CCG GAA ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA 
met gly arg his tyr gly pro glu ile lys trp val asp gly gly lys pro leu leu lys 


FIG. 1 1 A 
3 of 8 


JNSOOCID: <WO. 


0231117A2_I_> 


WO 02/311 17 


89/114 


PCT/USO 1/32 202 


2221/741 

ATT TCC ACT CAT CTG GTT TCT ACA GTG TAT 
ile ser the his leu val ser thr val tyr 

2281/761 

CAG TAC TGT CAG AAA ACC GAA TCT GGA GCC 
gin tyr eye gin lys thr glu ser gly ala 

2341/781 

CTT AAG AGT CTG CAT GCG ATG GAA GGC CAC 
leu lys ser 1<mi his ala met glu gly his 

2401/801 

AAC CAG CTG TTC CGA GTC CTC ACC AGA GCC 
asn gin leu phe ara val leu thr arg ala 

2461/821 

CGG GTC ATT AT" CAT GTG GTT GCC CAG TGC 
arg val ile ile r.is val val ala gin cys 

2521/841 

TCA TAT GTT AAG TAC GCG TAT AAG GCT GAG 
ser tyr val lys tyr ala tyr lys ala glu 

2581/861 

CAT GAA GAA CTG ACC AAA TCC ATG ACC ACG 
his glu glu leu thr lys ser met thr thr 

2641/881 

AGC AAC AAA CTA CTG AGG TAC TCA TGG TTT 
ser asn lys leu leu arg tyr ser trp phe 

2701/901 

CAG CAT TTG ATA GAG AAC TCC AAA GTT AAG 
gin his leu ile glu asn ser lys val lys 

2761/921 

TAT CAT CAT GCA GCG GAA ACC GTT GTA AAT 
tyr his his ala ala glu thr val val asn 

2821/941 

GGA GAT AAT CCA GAG GCA TCT AAG AAC GCG 
gly asp asn pro glu ala ser lys asn ala 

2881/961 

TGT TTC ACC TTC ATG GAC AGG GGC TTT GTC 
cys phe thr phe met asp arg gly phe val 

2941/981 

TTT GCT CCT GGA GAC CCA AAG ACC CTC TTT 
phe ala pro gly asp pro lys thr leu phe 

3001/1001 

TGC AAC CAT GAA CAT TAT ATT CCG TTG AAC 
cys asn his glu his tyr ile pro leu asn 


2251/751 

ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC 
thr gin asp gin his leu his asn phe phe 

2311/771 

CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC 
gin ala leu gly asn glu leu val lys tyr. 

2371/791 

GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA 
val met ile ala phe leu pro thr ile leu 

2431/811 

ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT 
thr gin glu glu val ala val asn yal thr 

2491/831 

CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG 
his glu glu gly leu glu ser his leu arg 

2551/851 

CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG 
pro tyr val ala ser glu tyr lys thr val 

2611/871 

ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC 
ile leu lys pro ser ala asp phe leu thr 

2671/891 

TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT 
phe phe asp val leu ile lys ser met ala 

2731/911 

TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC 
leu leu arg asn gin arg phe pro ala ser 

2791/931 

ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT 
met leu met pro his ile thr gin lys phe 

2851/951 

AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA 
asn his ser leu ala val phe ile lys arg 

2911/971 

TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT 
phe lys gin ile asn asn tyr ile ser cys 

2971/991 

GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG 
glu tyr lys phe glu phe leu arg val val 

3031/1011 

TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT 
leu pro met pro phe gly lys gly arg ile 
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3061/1021 

CAA AGA TAC CAA GAC CTC CAG CTT GAC TAC 
gin arg tyr gin asp leu gin leu asp tyr 

3121/1041 

TTC TTG GTG GGA CTG TTA CTG AGG GAG GTG 
phe leu val gly leu leu leu arg glu val 

3181/1061 

CGT CTG ATC GCC ATC AGT GTG CTC AAG AAC 
arg leu ile ala ile ser val leu lys asn 

3241/1081 

TAT GCT TCA AGG AGC CAT CAG GCA AGG ATA 
tyr ala ser arg ser his gin ala arg ile 

3301/11C1 

CTG ATT GAA AAC GTC CAG CGG ATC AAT GTG 
leu ile glu asn val gin arg ile asn val 

3361/1121 

GGC ATG ACC GTG AAG GAT GAA TCC CTG GCT 
gly met thr val lys asp glu ser leu ala 

3421/1141 

CAG AAG GGA AGC ACC CTG GAC AAC AGC CTG 
gin lys gly ser thr leu asp asn ser leu 

3481/1161 

ATT GCT TCT CCA TAT ACA ACC TCA ACT CCA 
ile ala ser pro tyr thr thr ser thr pro 

3541/1181 

AGA GGA TCT CTC ATA AGC ACA GAT TCG GGT 
arg gly ser leu ile ser thr asp ser gly 

3601/1201 

AGC AAT TCC CTG GAT AAG CAC CAA CAA AGT 
ser asn ser leu asp lys his gin gin ser 

3661/1221 

GAT AAA CTT GAC CAG TCT GAG ATT AAG AGC 
asp lys leu asp gin ser glu ile lys ser 

3721/1241 

AGC ATG TCT GAT GAT GCT TTG TTT ACA TAT 
ser met ser asp asp ala leu phe thr tyr 

3781/1261 

GAT TTT TTT ACA ATA TCT GAA GTC TGC CTG 
asp phe phe thr ile ser glu val cys leu 

3841/1281 

ATA GCC AGG AAC CAG GAG GGG TTG GGA CCC 
ile ala arg asn gin glu gly leu gly pro 


3091/1031 

TCA TTA 1 ACA GAT GAG TTC TGC AGA AAC CAC' 
ser leu thr asp glu phe cys *arg asn his 

3151/1051 

GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC 

gly thr ala leu gin glu phe arg glu val 
3211/1071 

CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA 
leu leu ile lys his ser phe asp asp . arg 

3271/1091 

GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG 
ala thr leu tyr leu pro leu phe gly leu 

3331/1111 

AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG 
arg asp val ser pro phe pro val asn ala 

33 91/1131 

CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG 
leu pro ala val asn pro leu val thr pro 

3451/1151 

CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC 
his lys asp leu leu gly ala ile ser gly 

3511/1171 

AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG 
asn ile asn ser val arg asn ala asp ser 

3571/1191 

AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG 
asn ser leu pro glu arg asn ser glu lys 

3631/1211 

AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT 
ser thr leu gly asn ser val val arg cys 

3691/1231 

CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG 
leu leu met cys phe leu tyr ile leu lys 

3751/1251 

TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG 
trp asn lys ala ser thr ser glu leu met 

3811/1271 

CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC 
his gin phe gin tyr met gly lys arg tyr 

3871/1291 

ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG 
ile val his asp arg lys ser gin thr leu 
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3901/1301 3931/1311 

CCT GTT TCC CGT AAC AGA ACA GGA ATG ATG CAT GCC AGA TTG CAG CAG CTG _ GGC AGC CTG 
pro val ser arg asn arg thr gly met met his ala arg leu gin gin leu gly . ser leu 

3961/1321 3991/1331 

GAT AAC TCT CTC ACT TTT AAC CAC AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG 
asp asn ser leu thr phe asn his ser tyr gly his ser asp ala asp val leu his gin 

4021/1341 4051/1351 

TCA TTA CTT GAA GCC AAC ATT GCT ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT 
ser leu leu glu ala asn ile ala thr glu val cys leu thr ala leu asp thr leu ser 

4081/1361 4111/1371 

CTA TTT ACA TTG GCG TTT AAG AAC CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG 
leu phe thr leu ala phe lys asn gin leu leu ala asp his gly his asn pro leu met 

4141/1381 4171/1391 • 

AAA AAA GTT TTT GAT GTC TAC CTG TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA 
lys lys val phe asp val tyr leu cys phe leu gin lys his gin ser glu thr ala leu 

4201/1401 4231/1411 

AAA AAT GTC TTC ACT GCC TTA AGG TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA 
lys asn val phe thr ala leu arg ser leu ile tyr lys phe pro ser thr phe tyr glu 

4261/1421 4291/1431 

GGG AGA GCG GAC ATG TGT GCG GCT CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG 
gly arg ala asp met cys ala ala leu cys tyr glu ile leu lys cys cys asn ser lys 

4321/1441 4351/1451 

CTG AGC TCC ATC AGG ACG GAG GCC TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT 
leu ser ser ile arg thr glu ala ser gin leu leu tyr phe leu met arg asn asn phe 

4381/1461 4411/1471 

GAT TAC ACT GGA AAG AAG TCC TTT GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC 
asp tyr thr gly lys lys ser phe val arg thr his leu gin val ile ile ser val ser 

4441/1481 4471/1491 

CAG CTG ATA GCA GAC GTT GTT GGC ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC 
gin leu ile ala asp val val gly ile gly glu thr arg phe gin gin ser leu ser ile 

4501/1501 4531/1511 

ATC AAC AAC TGT GCC AAC AGT GAC CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG 
ile asn asn cys ala asn ser asp arg leu ile lys his thr ser phe ser ser asp val 

4561/1521 4591/1531 

AAG GAC TTA ACC AAA AGG ATA CGC ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT 
lys asp leu thr lys arg ile arg thr val leu met ala thr ala gin met lys glu his 

4 621/1541 4 651/1551 

GAG AAC GAC CCA GAG ATG CTG GTG GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC 
glu asn asp pro glu met leu val asp leu gin tyr ser leu ala lys ser tyr ala ser 

4681/1561 4711/1571 

ACG CCC GAG CTC AGG AAG ACG TGG CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC 
thr pro glu leu arg lys thr trp leu asp ser met ala arg ile his val lys asn gly 
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4741/1581 

GAT CTC TCA GAG GCA GCA ATG TGC TAT GTC 
asp leu ser glu ala ala met cys tyr val 

4801/1601 

ACA CGG AAA GGC GTG TTT AGA CAA GGA TGC 
thr arg lys gly val phe arg gin gly cys 

4861/1621 

GAC GAG GAG GCC TCC ATG ATG GAA GAC GTG 
asp glu glu ala ser met met glu asp val 

4921/1641 

GTG CTG ATG GAG CTC CTT GAG CAG TGC GCA 
val leu met glu leu leu glu gin cys ala 

4981/1661 

CTC ATC GCC GAC ATC TAC AAA CTT ATC ATC 
leu ile ala asp ile tyr lys leu ile ile 

5041/1681 

GAA GAT GAA GAT GGA AAG GAG TAT ATT TAC 
glu asp glu asp gly lys glu tyr ile tyr 

5101/1701 

ATT TCT CAG AGA CTC CTT AAA CTG TAC TCG 
ile ser gin arg leu leu lys leu tyr ser 

5161/1721 

ATA CAG GAT TCT GGC AAG GTC AAC CCT AAG 
ile gin asp ser gly lys val asn pro lys 

5221/1741 

GTG ACT CAC GTC ATC CCC TTC TTT GAC GAA 
val thr his val ile pro phe phe asp glu 

5281/1761 

GAG AGA TCC CAC AAC ATC CGC CGC TTC ATG 
glu arg ser his asn ile arg arg phe met 

5341/1781 

AGG CAG GGC GGG GTG GAA GAG CAG TGC AAA 
arg gin gly gly val glu glu gin cys lys 

5401/1801 

TTC CCT TAT GTG AAG AAG CGC ATC CCT GTC 
phe pro tyr val lys lys arg ile pro val 

5461/1821 

ATC GAG GTG GCC ATT GAC GAG ATG AGT AAG 
ile glu val ala ile asp glu met ser lys 

5521/1841 

TCG GCC GAG GTG GAC ATG ATC AAA CTG CAG 
ser ala glu val asp met ile lys leu gin 


4771/1591- 

CAC GTA ACA GCC CTA GTG GCA GAA" TAT CTC 
his val thr ala leu val ala glu tyr leu 

4831/1611 

ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC 
thr ala phe arg val ile thr pro asn ile 

4891/1631 . 

GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT 
gly met gin asp val his phe asn glu asp 

4951/1651 

GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG 
asp gly leu trp lys ala glu arg tyr glu 

5011/1671 

CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT 
pro ile tyr glu lys arg arg asp phe phe 

5071/1691 

AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA 
lys glu pro lys leu thr pro leu ser glu 

5131/1711 

GAT AAA TTT GGT TCT GAA.AAT GTC AAA ATG 
asp lys phe gly ser glu asn val lys met 

5191/1731 

GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG 
asp leu asp ser lys tyr ala tyr ile gin 

5251/1751 

AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT 
lys glu leu gin glu arg lys thr glu phe 

5311/1771 

TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG 
phe glu met pro phe thr gin thr gly lys 

5371/1791 

CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC 
arg arg thr ile leu thr ala ile his cys 

5431/1811 

ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC 
met tyr gin his his thr asp leu asn pro 

5491/1831 

AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC 
lys val ala glu leu arg gin leu cys ser 

5551/1851 

CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG 
leu lys leu gin gly ser val ser val gin 
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5581/1861 

GTC AAT GCT GGC CCA CTA GCA TAT GCG CGA 
val asn ala gly pro leu ala tyr ala arg 

5641/1881 

TAT CCT GAC AAT AAA GTG AAG CTG CTT AAG 
tyr pro asp asn lys val lys leu leu lys 

5"01/1901 

GGT CAA GCC TTA GCG GTA AAC GAA CGT CTG 
gly gin ala leu ala val asn glu arg leu 

5761/1921 

CAA ATG AAA GCC AAC TAC AGG GAA ATG GCG 
glu met lys ala asn tyr arg glu met ala 

5821/1941 

ATC TGC CCC CTG GAG GAG AAG ACG AGC GTC 
ile cys pro leu glu glu lys thr ser val 

5881/1961 

ATC AGT GGG ACT CCA ACA AGC ACA ATG GTT 
ile ser gly thr pro thr ser thr met val 


5941/1981 

TGA TTA CAT CTC ATG GCC CGT GTG TGG GGA 

.. : A 


6001 

TTT CCA AAG CCA ATC ACT GGG GAG ACC GAG 
6061 

AAG GAA ATA AAG AAC AAC GTT ATT TCT TAA 
6121 

CAC ATA TTT TTT TAA ATC TCA CTG GCA ATA 
6181 

TGT GGT AGA CAC TCT TGA GCT GGA CTT AGA 
6241 

ATA GAT GGC CTA CAG AAA AAA AAG GTT CTG 
6301 

CAT TGA TGC CTG GGG GAC CTT TTG CCT CGA 
6361 

TAC AGA ACT TAC TAG TTT TGT CTA GGA GTA 
6421 

TCA TTC AAC AAC ATA GAG CAA GAA TAG TGA 
6481 

CTA CTG GCT TCA AGT CAG AAC TTT GTC ATT 
6541 

ATT ACA TTT CTA CAT TTT TAA TAC TCA CAT 
6601 

ATT TGT GCT GGT CCA GTA TAT GCA ATA CAC 
6661 

GCA ATA TGG AGA TGT ATA CAA GTC TTT ACT 


5611/1871 

GCT TTC TTA GAT GAT ACA AAC -ACA AAG CGA 
ala phe leu asp asp thr asn thr lys arg 

5671/1891 

GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC 
glu val phe arg gin phe val glu ala cys 

5731/1911 

ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA 
ile lys glu asp gin leu glu tyr gin glu 

5791/1931 

AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG 
lys glu leu ser glu ile met his glu gin 

5851/1951 

TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC 
leu pro asn ser leu his ile phe asn ala 

5911/1971 

CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG 
his gly met thr ser ser ser ser val val 

5971 

CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC 


6031 

CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA 
6091 

CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG 
6151 

TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG 
6211 

TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA 
6271 

GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA 
6331 

CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG 
6391 

TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC 
6451 

GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG 
6511 

AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT 
6571 

GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA 
6631 

TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT 
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A. Allelic variations: single nucleotide changes (polymorphism) between CLASP-2 cDNA 
isoforms 


Isofoim 

Difference 

Nucleotide(s) 

Consequence 

1 

polymorphism 

862 

A to G change; mis-sense 




mutation 

2 

polymorphism 


A to C change; mis-sense 




mutation changing codon from 




histidine to proline 

3 

polymorphism 

2210 

A to G change; mis-sense 




mutation changing codon from 




aspargine to glutamic acid 

4 

polymorphism 

2225 

C to T change; mis-sense 




mutation changing codon from 




histidine to tyrosine 

■B. Alternative splices 

Isoform 

Difference 

Nucleotide(s) 

Consequence 

1 

exon deletion 

209-291 

premature, in- frame stop codon 




leading to the production of a 




truncated, most likely soluble 




protein 


These differences may be found separately or together in various combinations in the different 
human CLASP-2 isoforms 
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1st exon (nucleotides 335 to 445) 

TGTTTG^TT A TrrTTTrnrrrTrr a anr a a a arc, a a a GCTAATTG AGCCACT 
CGACTATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCC TGAACGACTGT 
TTACGGGAGATGCTGCTCTTCCCTTACGATGACTTTCAGG TAAGTAACGTTAT 

QTTTCTATCCGTAGAACCACG 
2nd exon (nucleotides 7101-7190) 

TTA CC* a a r T r.rTTTTrrTrrTrTTTTTTGTTTCr AG ACGGCCATCCTGAGACGA 
CAGGGTCGATACATATGCTCAACAGTGCCTGCGAAGGCGGAAGAGGAAGCA 
CAGAGCTTOTTTGTTACAGAG GTAAGGCTCTTTCCTGCATTAATTTACATTTT 

GAAGTCATTTTCCCCTAACTGCCTCC 
3rd exon (nucleotides 1 1439 to 11521) 

TTTTTT A TTT* I * l" A A A A I TranTTTCA ATAG TGCATCAAAACCTATAACTCTGAC 

TGGCATCTTGTGAACTATAAATATGAAGATTACTCAGGAG AGTTTCGACAGC 

TTCCGAAGTGAGTAAGCTATATTATACACATAGGGAAAAGTCTTT 


4th exon (nucleotides 13987 to 14056) 

r^TA a a apa A ATTTTrTTTfiTTaTTTTTATAn CAAAGTGGTC AAGTTGGATAAA 
CTTCCAGTTCATGTCTATGAAGTTGACGAGGAGGTCGACAAAGATGAGG TGG 
GATACCTGCTTGCTGTTGCTTCTCTTTTCACTCTAGATTTAA 

5th exon (nucleotides 15212 to 15307) 

qGA r.r.TTn a rTrw-Tr.r.TnTTTTrrTTrTrTr.CTAG GATGCTGC CTCCCTTGGTT 

rcCAGAAGGGTGGGATCACCAAGCATGGCTGGCTGTAC AAAGGCAACATGA 

ACAGTGCCATCAGCGTGACCATGAGG GTGAGGACGCACATCACTTTGCCCTC 

CCCTCTCACAAGCCCTTTC 

6th exon (nucleotides 16269 to 16404) 

TG AAA ^ a a x A nc TfynTTfYT ATA TTTTTC1TCTC AGTC ATTT AA G AG ACG ATTTT 
TrCACCTGATTCAACTTGGCGATGGATCCTATAATTTGAATT TTTATAAAGAT 
fiAAA AGATCTCCAAAGAACCAAAAGGATCAATATTTCTGGA TTCCTGTATGG 
nTGTCGTTCAGG TAAATATGAAAAGAGTTTTACCATTATGTTTTCTTA 

7th exon (nucleotides 19459 to 19633) 

Aj^QTATrrrPTflTTTATCPTTTTTTr.AT^ 

GrTTTTGAGCTCAAGATGCAGGACAAAAGTAGTTATCTCTT GGCAGCAGACA 
nTGAAGTGGAAATGGAAGAATGGATCACAATTCTAAATAA GATCCTCCAGCT 
PA ArTTTGAAGCTGCAATGCAAGAAAAGCGAAATGGCG ACTCTCACGAAGGT 
AGATAGGCTTGGCTTCCCCCAGGCACATACACACTCT 

8th exon (nucleotides 20567 to 20634) 
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ATTACAAGTGATTCCGATAATCTGTTTTGCCATTTTAG a to atg aac aaagc a 

AATTGGAArTrTTTCTGGTTrrnGTTTArtATAOCTA rrTrTCCGGAACTTGCCAAG 

gtaacaTcgtcttatatcttctgctcttcgttgaatgc 

9th exon (nucleotides 30257 to 3033 1) 

G ATTCTCTTAAATOTAATTTT'" A A TrT-rm-r atp a n A GTGCAAGAGAAGC 
AfiAA ATC AA A CT AAAAAOTG AAAGCAG A GTC A AACT TTTTTATTTGGACCC A 
GATGCCCAGG TAAGAACTATCTAAATGTTTAATATTTAAAACCAAAT 

10th exon (nucleotides 3185i to 31991) 

C ATAACTTATTTA T A TOTTT A r ' a TTTTrTTTT a aap,aa rTTTTOACTTCTCATCA 
GCTGAGCC AO AAGTG A AGTC ATTTGAAGAGA AGTTTGG A AAAAGGATCCTTG 
TGAAGTGCAATGATTTATCTTTCAATTTGCAATO CTGTGTTGCCGAAAATGAA 

OAAGOACCr.ACTACAAATG TAATTTTTCATTTTAAAAATAAACATTAAAAAA 
AAAATAGGCAG 

1 1 th exon (nucleotides 32472 to 32675) _ mWTV , „ A 

CCATG GTG ATC ATTO^r A TTrvrTTTriTTTTriTTf; A O GTTGAACC 1 ' 1 1U 1 1 iGTTA 
CTCT ATCCf'.TGTTTGA C AT A A A AT AC A A CCGG A AGATTTC -TGCCGATTTCC AC 
GTAGACCTOAACCATTTCTCAGTGAGGCAAATGCTCGCCACCACGTCCCCGG 

CGCTGATG A ATGGCAOTGGGC AG AGCCC ATC TGTCCTCA AGGGCATCCTTC A 
TGAACjCCOCCATGCAOTATCCGAAGCAG GTGGGGAGTATGAGCCCAGCATTC 

CCACTACTCAGACTCACTTTGCATGC 

1 2th exon (nucleotides 33063 to 33185) 

CAATTCTGCTTACTOAACTfl f\ A KTY^-r-rrnrrrrrTArtnaA ATATTTTCAGTCA 
(TTTGTCCTr.ATCCAGATATATTTCTTGTGGCCAGAATT OAAAAAGTCCTTCAG 
GGGAGCATC ACACATTOCGCTGAGCCATATATG A A A AGTTCAGACTCTTCTA 
AGGTATGAATGGCTTTTACGCTTTGGGGTGGTAAAAAGCAATCTGAA 

1 3th exon (nucleotides 36702 to 36784) . 
CACTATCTCATAGCTTT A TT r " rr> A t/tttttt a a nnmOTCC AG AAGGTGCTGA 
AGAATGCrAAGCAGGCATGCCAAAGACTAGG AGAGTATAGAATGCCATTTGC 

TTfiGGCAGCAAGG TAvAGGAACACCTTTTATACCTTTTAAATCGATATAGATA 
GGTGCATGG 

14th partial exon (nucleotides 37353 to 37475) 

C AAACCCAGTTTAOA A A Tcvrrnrrrrarr. atttc. AGGACATTGTTTAAGGATG. 
C. ATCTGG A A ATCTTGACAAAAATGCCAGATTTTCTGCCA TCTaCAGGCAAGA 
f! AGCAAT A A QCTATCC- A ATG ATGAC ATGCTC AA GTTACTTGC A GACTTTCGQ 

AA 
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XAAXG&CTGGAA6CTCCAGC3VGAA 


1 TACC^AGGGCJ^TCTGGCACACCCTAAAGTCTG^^ 

93 TGCAGAGCTTCCTTTGTGCTTAAATAACTGAAC^^ 

185 CTTAmK rrGITTCTTICTGaTA^ 

277 AfcGAAMGCXMTTCn^ 

369 tcaxcGTCCAG^GAAGACTCAGA^ 

4 61 TTTCTAITCCGTAGAACCA^ 

553 MTTTTTTTGA^CACAGTCT^ 

6 4 5 AGCATITCTCCTGCCTCAGCCTCTCAAGTAA^ 

737 GTTTTGCC^GGTGGCC^GGGTCGTCC 

829 GCCJM^GTGCCCGGCCrAATTATGGTTTTTA 

1013 GTGTATCTTGAAITAAAAGTGCrATACTCT 

1 1 05 TTTTTGGCCAACTACCTTAG? 1 JVrCTTAAT 

1197 AGCCTGCCTTG^GAGGCCCTGC3\CTCTGTCT 

1289 AATGTGGGTGCTGTTCCTTGGAAGAAATGTGGAGGACT^^ 

1381 ACTAAC7WiV\GCTGTG7\GGCTGTT^ 

1473 TTGGTCCATTTGGAGGTTCTGC^^ 

15 65 ACACITCTCTfiCCTGATGCTGTrACCATTa^^ 

1657 TOG p^cCAAAAGTGTTACATCCOT 

17 4 9 ATCCCAICCCAmGAACA^ 

1841 ATCCTTGAAGGTTITTTCCCACCAAATIT 

1933 j^G^ACXrTGGGCCCCAAAATGATT 

2025 TGCTACCAAAGTAACACGTAA 

2117 GAsCTAATAARTGTVGAACCTCTGAAT 

2209 TGTCACAACJOm 

ATTAAAGTAACAACTTTriTGAGTT^ 
ACGAGATACACTTACTITC^ 

2485 GAAAACTTCTGTGTC03CTTIXXTTTT^ 

2577 ^scTAAGCTTGATC^^ 

2669 * GGAATTGTGATGGGCCC CAGTGAAGTTTGGGTACAAT^ 

2761 GCTTTCTTGTTAAATTGTCTCAGTGATOT 

2853 TGTGTTTTCTTAfiGCAGAATAAGLAGCA 

2945 GAAAAGTAATACCAACTTGCH^^ 

3037 taacTATTAAAATAATAGAACTC 

3129 CTTGTCT^CArri^^ 

322 1 TGrCAAGAAAAGATTTGCGGGrTGCAlCTAGT^ 

3313 AAACCTGGAITTTTAAAAATCA 

3405 ACAAGGTCAGC^GTTTG3\GACCAJ3^ 

3497 CCTGm^CCAGCTACTCGGGAGGCTGAGGC^^ 

3589 TTCCAGCCTGGGCGACAGAGTGAS^CT 

3681 CTGGGCAO^GTGGCTCACACCroi^^ 

3 7 7 3 CAACATGGCAAAACCCCGTCTCTACTAAA 

3865 GCAAGAGAATCrcTC^ 

3957 J\XCTCAAAAAAAAAAAGGGAATATT^ 

4049 CTAGCTGATGGCCCTTCTTTTTC^^ 

4141 AGCATCTTCTCTTCTAGATCT 

4233 CCCACTGTCTCAATTCCTTItX^ACTGCC^ 

4325 GGCTTCTTGCJUICCACGCATCCCGCCCCCT 

4 4 17 CMGCACAAAATC CCmOT 

Ab09 TTCmGArrCATCAACTAAATOT 
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TTGGTTTGGC^G7W7rAGG^ 

CTTTATTmCTT^^ 

TTTAACAAGCWTTTAG^^ 

GCCTCTGITTAAC^^ 

ACGPJTFCTTCTTCA^ 

GGCTTTTAACKTrCCTTGTATC 

TAGTAAGAAn-GTTGGGAGACAATTTAAGCTAC 

TTGTATGAAAGC^^ 
GGCPSGGTGGCTCACGCCTGIPJ& 
^GGTGAAACCCCATCTCTAXT^^ 
TTGCTTGACTTCAGGAC^^ 

CCCGCCTGTAGTCCCAGCTACTCJySGlAGGCTGAA^ 
TGCACTCCAGCCTQ^GTrmC^ 
CTTTGGGAGGCCGAGGTGGGOy37VIC7^^ 
MAftTrAGrrGGGCATGGTGGCAGGC^ 

TTAAGCCTAftIX5AACACGA.GCA^ 

MCTTCTGTGTGTCTAGTTATCTTVTGCT 

GATGGrTTGCACATVTTGTGTGAATCT 

AATCCCAGCACTTTC^ 

CTAAAAATOCAAAATT^^ 

GGC(KAG^GCA^ 

TT^jtfVTTTATGCAT^^ 

GTGTTAGGGACTTTGAACGGGC^ 

TAACCr^CTGGCTTTGGAACTrGAGC^ 

C^PCTCTGTGGCTGCC^GGG^ 

CTGTlTrrTGTTTCCZAQVCGGCCATCCT^ 

TCTTTCTTACAGA^^ 

AATGTC CACAGGGGCTATTTCTTTTTTACA^ 
TTCCTOVC^TrTAT^ 
. TTTTAAGGCTGACACACACA^ 
GCK^ATAAXAAGCAGGGTCTATC 

ATTATTTTTOTTT^^ 

AGTCTTGCTCTGOTGTCTAA^ 

TCAGC^CCAP^AGCTGGGAITAC^^ 

y^SGCTGgrCTCAGACT^ 

CflGAAAGAGftAATTATTACA^ 

TAGATCATAAATATGCTCATCAATAPAATTC^ 
CCATTCAGAXTTXT ^^ 

CCTTTGCCACTATArlAGT^^ 

TGGCAAACTTOX^CCnXX^C^ 

TATTGCCIATGGCTACTGTCACCOT 

CACAAAA7y3TTTGTTGACCTA0X7n^ 

TTTGAGGGAGAACTCTCCATTAAGTTAT^ 

ACTTTOX3GACCCOTCCAOT 

(^GTCACArrcCTGTCrT^ 
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GGGTGGGGGTTGCre^ 

TGAATCTCTft?^;^^ 1 mTI l 1 1 1 1 1 1 IGT^G?TGCGATTCCTl^AA!ITArAAATAIA 

TTTGGCWiGCCGAGGCGGG 

TAATCTCCTGCAGGTUGACTTGGTGCT 

TCTGAAATCTO^CTGCn^TTGG^ 

GOCCTTCTCTtSCXOTTrATTCCMCTCT 
TTATTCTAGAGCCTG^^ 

TMAGCCAGAGCCAAATAAACTA^ 

GACACCMGCTGGCCCATTC^CTGTCM 

TCACTCTTTTCCrc^ 
M7\CCAXGAAGGCGTCTTTAAGTTCACG 

T GAA7\TAGTCCATTGTT^^ 

A7\GTC3\TACTACCCCCT\AGGGTA ■ 
TO(MGG GAAAAAArrrTTTTC^^ 

AGAGTTTCGACAGCTTCCGAAtTO^^ 

TCATATGCAATCAGA^^ 
CTCAAAJC^GCTACITI^AITG^ 

^j3XXAAATTTTTTTTTTATCrc 

MTTTCTAAAGAA^^ 
CACGAXCTCTCCTCACTG^ 

MCACGCCCGGCTATmCTMT^ 
CTGCCTCTGCCTCCCAAACTGCTGGGOT 

TCCC^TTGCCAGGAACAAGAATTCT^^ 
• ^ttatGCAACAACGGG^ " 
TTGATTTTGCX^AAATTGAAA^ 
TATTmCTTTCCGTTTGTTAC^ 
TTGCCJUirrCCTTAAGAATGTTCGGCCAAITCCCC 

OTGGCTTAGCCCTTTAGTTCTA^ 
GGCATTAATGAGAATrTCCAAM 

gactctck:cgtctgtc^ctgattttcctc^^ 

TCATTOTAATCGGITrC^^ 
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13801 ACTCA*CTCTCTTGCCTTAC^ 

13893 TACTtMGCCCncacCCrZXTZ^^ 

13985 AGCAAAX^TGTn-CAAGTTt^^ 

14077 TGCITC7CTTTTCAC7CTAG7a^ 

141 €9 GACTC7TAC\TC7JUU^^^^ 

14261 (TTTCTCTTC^rTCTCT^ 

14353 ACCTTt ^ATTTTJSCTTGTTTTTCAACAGM 

14445 AGTCTGATTTC^axZ^ArXXXACT^ 

14537 TGCCCACTJ ) CTCyNgC»VAACCT(^W 

14629 TTTAG^I 1 1 1 111 .Ki. * AAAAAAATTGTCftGCTGAC^TATAAC^ 

14721 CCCCCAIT*-TTT AT^TTCTCCTTT^ 

14813 CAGGAAP5TGCAT O : V*, : '1 H 1 1 IC V GACTATTUSGGGAiCTGrnilATTTGT^ 

1 4 90S GTGcnx^CTrc?jccrr?crrm 

1 4 9 97 TGGAATAAACrrrTTCASAAATC7AOOC^ 

15089 CTTCAGTZAJSttJ J^MJGC^^MTTTO^ 

15181 GTTGACTU- : OU T ^1 : ' I I C I : T w 1 L 1 11 1 AGG^GCTGCCTCCCTTGGTTCCCAG?^ 

15273 GCAACKTC2%AC*GTaX^\T:i^^ 

15365 CGAGAAOATOCTCW^r^AJVir^^ 

is 457 ccovrraxjkTAi^TCAircAc^ 

15549 AC/\CTGACAT7CX*rr^ 

15641 CHPGAjGCACTTCTTrrcr^TCT^ 

15733 GC^USACAGA/^wnt^AACCA^ 

15825 AAGG<^GTCA^VCATAAAAAXT^^ 

15917 TTCCCATTCTOTUW3C3*A5AAACa^ 

16009 TTGGGTAGTTATCTAWOrTCI^ 

16101 AGCTTCTTA^ATGCTJm J^JJJTCTIGACCCCAC^GTGACCTGGAGCA^ 

16193 GTCA3?\AGTCXX7TTATTCi*XAC^^ 

16285 TTTTTCCACCTC^TTCAPjrrrcCCGA 

1 6377 TCTGGftXTCCTCTr/JGGGTCTCCTTCAGGT^ 

1 64 69 GCAGATTTAAlXAAJsCACT^ 

16561 TGTA(3^^ATGGiAAO'TV\TAATGTA^ 

16653 AACAGGCTACTTAATTCAGATAATGAGAAT 

16745 TTCMAGTATCTO^CTC^^ 

16837 AGTTCM^CCCTGCTCXrrcCACT^ 

1 692 9 AGG£\IAATGGCAGTACCAAATATOT 

17021 TGGTC^GTAITTAGAXACTTTTGTTATCA^ 

17113 AAGA33ACAC^GAATCTArrrn*CTTGT 

17205 CTTITCTGAGGCXSVGCr 

17297 ATCATCTCTCCATCO^CATCCATC 

17 38 9 TTATAAGCATACftCAGGACAXG^ 
17481 . AA3ZAGAGTTIAAGGAGAAAGC 

1 75 73 CTCnTTAGTC^GCAATTCAC^^ 

17665 TSUlAGCAAAACrAAATGrnvrTT^ 

17757 GCCCAGGG^CAGATGTTTATAiCT7U3AA(rr 

17849 AAGTCTCTBXTTCAAAlTGTCm 

17941 TtflUTTTlMCTGTGTCTGTTTCT 

1 8 033 TAGTTGACTCCTarrcrGAAra 

18125 AGayg^GAAAJVCTGAGGCTCAGSVA^^ 

18217 CTATGCAGGGftAAGG£UK5&GAGAAAATTC^ 

18309 GTGGCTiJOTACTGAGCCCACAGCAC^O^ 
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TGCTO^GAATCACA^ 

GATTTTTXGTGGCTArGCGGGGGGCTGTCCTGCT^ 
TTGTGCCATGATCCTC^AGCTGAACCT 
CTCK5CAITCTATTGGCTTAAGGCC 
TCACHAC7IACCGAGATT 

AGTCACCCTCTTGCAGG^ " 
AAGTTCC7\GA!I7UVGTAAGCCAAA.C^^ 

CTGWGAAATAT(3VAAAOT^ 

ATGArXiAGGGGGCAAGCTKTTTAAA7\I^ 

TTCTA^TTCTTTTAAAGTTVTGTCTGTn^ 

^AAjTTyXTTATCTCTTGG^ 

TT<XCAGGTGGGTAT7W^AAGQ\GACCT 


' ^• r lA7^ATAACTT7\C TTCCCTGAC—^tTTTC GT, ; 


CCTGGGTTTGLAGTGCCATAAA^ 

AACCCTTACXnXj3\TTCCCAAC ' 

ACTItrn^GGACGGTGTCCMGCTAJACC^ 

CCAACTXH^CCCAG^GCATCCA^ 

GAAAIXCTyjGAATTACAAGTgOTCCGATJ^ 

AIAGCT7\CCTGCCGGAACTTGCCAAGGTA 

TCTTCC^GAATGGAATCTGTTGAAATCJ^TAA 

CAGXAJGATTTCAGGTCTGTCTTCTAAAAAJ^ 

AAOVCTGCfcTCA!^^ 
CTCCCTTACTCAGGGTAATAGACACGGTT^ 

GOTrCGOTCACTCTTACTGGAAAAA^ 

GCAAATCAACCATACTGCTACTTCCCAa^T^ 

TTATAGTTGAAGTTCTTTTTAAACACTGTCC 

ACTTTCATTTrTATATCTTTACAAA 

TACCATATGTATIXrrGCACrn^AAA^^ 

CATTTTTTTCTATTATAAACAAAGCT^ 

TATCTTTGTClTCriXrTAGCCCriTGCA^ 

CCACATCTTAAACACTAXAATA^ 

ATGGCTGTTCTCAAGTCTAAAATCTC 

TAGCACCTG^CTATTAAAGCTA 

jVTGTCTGTCJrATCATATTTTGCA^ 

ATTTGGGTITTAGTTTGTTGCAGCAGGTAATATATCT 

GMACCTAGGAGAAGATC^ 
CTCCTTCICTTACTACCCA^ 
ATGTATACATITATGTATATAXACT^ 
CAAGTTAGGTTAAGCCATm 

CAGGCTAGCTICTAOT 

AATCCCAGCACTTTGOT^ 
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23001 GftAG(X3\GJ\GAra 

23093 CGTTTTAATGAAATAAAAXOT 

23165 TTTGOTGGO\CCTGCC3*CC3\^^ 

23277 CTAACCTCAATWSATCCACCC^^ 

23369 AGTGAATTACATAATTIT^ 

234 61 GATMAAAAGATAXTTGTTTT^ 

23553 (^ACTTTGAIATCTCTATACftATGT 

23645 TGAAACATTTCTTACCAGGAGT^ 

23737 TACAAATACTTTTATTTAGGAA^ 

23829 GCACJO'GGGTATCTGTGGGCTTTGCCTTTTAAAGCCC^ 

2392 1 CCCCJ^CTTGAGCAAAAATG^ 

24013 TGRCCTGMAACTGTTGGTCATCCCATTAGGAAGGM 

24105 AGAGAACTGTGQV\GGAGTTTACCCA^ 

24197 CTGGGCa^\GTTa^GTA7VG3^^ 

24289 TTTTCTAATCAGCTCCTCT7\ACCTC CTTCATC CTGTTGC CCCGGCT^ 

24 381 ACAGGAAGTTGTTTCAGTGCA7^AAATAACT 

24473 AGGTTTTCAGAATATTTTCTA^^ 

2 4 5 65 TATTTACCC17W\ACTGGTTCTTTTC 

24657 TATAAATATTTTMTTTGJVTACACAA^ 

24749 AAACTTCATIACTAGTTATTTT^ 

24 841 AGTGTGAAG7\AAACCC^ 

24 933 CTTTGGTTTCCCCATGTGGTTCTGAAAGAGAAGTAGT^^ 

25025 GCAAATGAACTTATTGTTCCAGC^^ 

25117 TGAGOVTCAAGGGCCAIX^C^^ 

25209 TGCTTTCTTCCCTAGAXTCCAATCCACy^^ 

25301 TGTTGCATTTCTO^GCCTCCTTO 

25 3 93 CTCTTTAAATTAAAGftXTATATA^ 

25485 ACACfl lT <JTri£TGftIAAXTCATC 

25577 AAGACACAGAGTATGCCTAAAGCCTACMCMM 

25669 CTTGTCAGGC?rt7rGTCATTAGTGClT^ 

257 6 1 TCCTTGTCTTCTCAGCAAACACCAGTT^ 

25853 CGAAACCCAGACTAGTCACTGGG^ 

25945 AGACTGACCACAGGTTAI^^ 

26037 TGTATTTTCTCCACTCTTAGAIUX^ 

26129 TGCCGCAATAAGTAAAAATACCCAAAGrTTGTO 

26221 TTAAIAACCGCTTGCAATTCCXCTCT 

2 63 13 AATATC^a^TGATAIATACATr^^ 

26405 TCCCCCTTACttlACAIAX^ 

2 6 4 97 X^ lVr Tl U v lUT C3UGTTTTTAGCCTCT^ 

26589 CCCCAArGTGGAAXTTTrAAGTCA 

2 6681 AirrCTGTTTCTCTGCArCTTCACCAGACGAGT^ 

2 677 3 CATATTTTCATATATTTAXTT^ 

2 68 65 GTIAAIA3TTTMACTCTGAAGCCA 

26957 ATGCCTTAGTCTriTTCTCTTTTAATGAGCAG^ 

27049 AICTAGCAGGTAGATATTGO^ 

27141 CTACCXXTGAACCACAXTTTTCTA^ 

27233 TAGACXnTAGG3USGGATGGGAJSGAACTTT 

27325 ACAAGCTTTTAGGTTATTTTTGCATCTAAAGC^ 

27417 GftGftAAATCAGA^ 

27509 CGAGCTCAOTAGCAGTTATTCTTGCTTT 
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GAATTGCMC1?\CATTA^^ 

TTCW3\AGTAGA^^ 

AAGCAAGTITCTITGGG^ 

AGAA1AAA^IX3\ACTGCT^ 

GCAGGWZCXXrCTTO^ 

GTTTAACATTTTTACCT 

TCTZW^TVirrCTGOTGCCO 

TT^GGCTAAGAAGAITTAAGATGCTTArrTT 

GACGCCTTTTCTGCTGTGGTTG?T GAGTTGC 

XAITTCCGS^CTCCATAGftCTTAACTCT 

GCATGAAAAATCATAAAGATG^^ 

CCCTCATCTCTGCC^CATCTCTCGGTTTTCTICCG 

CTCaCTTTCACCCTCTCCCT^^ 

TTAGCTCTCTATTGGAXGTTTCC^ 

TTTGGCAGATTTCTTTTTCCTGA^ 

CGGCCCTGCTGTT^^ 
CCCTGCCCC^^GATGCTTC 

GCTGCTGGTTGCCCKrrTTTAICTTTT^ 

ACCTGCTGTGCCTTCTCCTAGAATGCGCTriTCTCCTTTC 

CTGACGACCACCCCCTAGTTCCAACT^ 

CTGGTGTTCCTTGGGACTAGGGACTTCCTrCA!I^ 

AATTAATTAGCATCTTGTCCTTCW^ 

AGTCACAGACTCGACTXTTATTAAATCCTCT 

GATOTTCAGCrrcGATTCTGCCCCTTC^ 
TCTCTTTTGCTITCTTrGTGTTG?^ 
ACTGCTTTCITIGTGTGCTGCACACTCCC^ 
GGGCAGCTTTTAAATGIMTTGGGCACCTTTG 

AAACCAAATGTGGGAGAGAA^ 

ATTGGGMaGATTTATTGA^^ 

TGCAGTTGCCTGGGA17rCAGTGATAATTCCC(^^ 

TTrCTGTTTCCTCACAAACCTCATAAC^ 

GTAAAAIATACATAAIAIAAAAT™ 

TTACCACCATCCATTTCCAG^ 

ACTAACCTCTCHTXTACTCTGT^^ 

CTACMTTTATATATCCTTGTAAA^ 
AGCTCTgTATTrnTO^^ 

TCTTGCTCTGTCGCCXIAGGCTGGAGTGCAGTGGTGCA 

CCTCAGCCTCCCGAGTIAGCTGGGA^ 

TTAGCCAGGATGCTCTC^^ 

GCCCATGTCTTTGACCATTGTIATAAACIA^ 

TAAGTGTATAITXGCTTTTCCAIArTTT^ 

TTACAITTTCTTTTAAAGAAGCT^ 

GTGCAM'GAlTrATCTTTCAATTTGCAT^^ 

AJT^AAAAAAAAATAGGCAGAGGTT^ 

TTTG(3VrCJVGGA37\TG^^ 
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GACAGAGTTC3CTCTTAGATGCOT 
TTTTCmGTTTCCTAAAOT 

TAGCTTCTTCTrCAXTTCTCTTTT^ 

ACCTTTCTTTCTT7\CTCTATCC^ 

GGCAAATGCTCGCC&CCACCTCCCCGC^ 

CAGTATCCGAAGCAGGTGGGGAOT^GAl^ 

TTAGACCTTCTAATCCACAT^ 

TTTAATTTCCTTTCTrTCTTTCTAA 

CTAGTTTGGCACTGGGTGCTTTTT^ 

TTCraTTACTGJ^GAAAATTGXTTGCCT 

AMmXTTCAGGGGAGCMC^C^CATTGCGCTG 

GGTAAAAA3C^WTCTG?VAAAG7^3<^ 

GAATATCCTACCTGTATTT7VCTCTGAACTTT 

CAGTICICTCATCCTAAAAT^^ 

AGrrrxrTTtrrKrCTGTT7ATTTGCCTCTTGO 

CCTTGCCTX^GATGACCTGC^ 

C^J^ACATA£ATGJ\TGT^ 

GftGCTX 3CCP J5GGGGCACTTGRCTGCCACTTC 

ACATATITTIT^GTTATTTA 

(XC7X7iGCCT , C7rGAGTGAAGTTT^ 

G?\XGC££GATGGCCJVrT(^GAAAT 

CACCTCTCJa , «^CTCCATATGTTCCT^ 

AACACMC^TCTTATGAGTCA^ 

GCCCATTIACCCAGGCACATT GGCT 

U^n^^mAAACCAAGTCTCTAA 

TCTCCTCCCCCGACCGTGTATGGAACA^ 

GGCAGCTCATATTGGTGTAAAAATCACT^ 

TOTTTTTCTTCATTTTTTTA^ 

TAAAGCAGGAATTAAGGCTCAACTAIXn^ 

TTTCTTCATCnTTTCCAA^rGGTCT^ 

CCTGTGAATAS^GGTTTT7WW\GAAAAAGAAGGG^^ 

TTCn^TTTGCTCCCTTCJVrATCTCCGA^ 

TGAAATTACTGQATTTTACTCC^ 

GCTAGGTAG-3CCATTGTACCCTATAACTCATA 

GGGGGGAjS3TCCTAGATGATTCCCTGACATCTCTT^ 

TATGGTAS3AITATAAAGTTTTGCAACT 

GGTTTSVTGGCCTACAGAGTCAXACAGCTGGATTC 

CCTTCCTGTGGCTGATGGTACACACAG^ 

TGCATTTCTACAGTGATGCTGATGCTATAGC^ 

GACCCTTCAGTAAAAAAAAAAAAATJJ^ 

AAGCJCTATCACAGTACAGA^ 

GGCTAAAAGCAAATAAGCTTC^ 

TAATTCA1GTTTAGAAAGCAAGAATATAGCA3^^ 

TCAGCATAAGCTAGTGCGCTT1AGGAGCTGTGAAAGCTT 

ATGGATACCAAATAGTAATTATTAATAACTCAGTA 

GTTTGTTGCCCCTAAAGACCAAAGAAGAi^^ 

ATAAACTACTCTGCACAGACTGT 

TAGTGAGCA2CTAAG<^GGCTOT 

AGAAGGTGCTQ\AGAATGCCAAGCAGGC^ 
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36801 ATACCTTTTTW^CGATAX 

3 6$ 93 AACTQV^AAATACATAAGCCCAGMTTTGAAA 

3 g g 8 5 TGGCAGGGTGCACGGTAGTGTTAGCTG3AGAGCT 

37 077 cTGGGGaX3VTCTTTATCTCTT^ 

37169 CTATCTAAATTGATTTACTTGTTC 

37261 TAATTAGACTTCTCATAGAAACT 

37 353 eacftiTGXTTAAG^ 

37445 ACATGCTCAAGTTACTTGCAGACTTTCGGA^ 

37537 ^ftCAGGGGAGCACTCACTT^ 

37629 GATGGC^TTGJU^AATCAGA 

37721 AGCCTATGGTGGGGAGGAAACAGTT^^ 

37813 GACAAATTCAAGTCGTGCCAAAGAGATACGATG^ 

3790S CTGGAAC7TTTCTGGC3\CCAGTTGTTG 

37997 TGTCTAAATTCCCTTCTTAGAACACTACT 

38069 TTTCCAAATAGCTCACAXTTATAGG^^ 

38181 GGGAGCAAGGAATGTGT GCAAC7VTCACAGGGC C 

38273 TGCTGCTACATCCTAAAGAGTTC 

38 3 65 (^XiAATGTGAAAGAGCTGAGGGC^ 
38457 T Ol 1 1 i i I rm U Tl T J GTOCMMCHtf^ 
38549 TTACCTTTTTCTTATCrGTCTTCAtn^ 
38 64 1 TTCCTAATCAGAATCCAATGACCTC 
38733 GAAAGGAITCTCAAGGTCT^ 

38825 AACCTCAGTTTTGCTAACATGCAGCTAA^^ 

38917 GCTCACGTAGGGAATTGGAGAAGGGGGAGAGGAGGATACT 

39009 GGTGCCCCAAATTCCCAGTCACTATCTGACAGTTTTATG^ 

39101 CTCITATTCATGAAAGACTAAA^ 

39193 CATTTTAGAAATTTTGAQAAA 

39285 y xw^xuVlU U ' I ' r TTGM^ 

39377 irCAAGCGATIXTTCCTGCCTCAGCCTCCCA^ 

394 gg TGGGGITTCACCATAITGGCCAGGCTGATCTCAA 

39561 GAGCCAO^rcCCCAGCCTAGGGGGGA^ ( 

39653 ACTAAAIATGTAATATTTCCCTGCACAA^ 

39745 TTAXAATAGTCCATTGAGATAQ^CATA^ 

39837 GTCGCCAC^rTGGAGTGCAGTGGCACC^ 

39929 AGCTGGGACTCCTGACTAGCTG5\GTAGCGCATGC^ 

40021 TACTAAATATCTCACCATCTTGCCCJ^GGCTGGTCTCA^ 

40113 AGGTGTGJ^CACCATGCCCAGCCATTTT^ 

40205 AAAAAATCTTCrCTATTGGTACTGGGXAATATAlTATCArACA^ 

40297 ATTTCATTCTTACTAATTTTTTC^ 

40389 TAACCATGAGGTGGGTCTGCGTGTACTTACX^AAACT 

40481 AAGAAAAT ATATATTTC TTTTCT^ 

40573 CXiATATATGC CAAAT AATCAXGGTTA^ 

40665 TGAACTTTTAAAAAACTAACACTTTTATAT^ 

40757 TTACTTCCTA^TGGTTAGCTC 

40849 GGGAGTGGGGCTTCACATGGTCACTAATTTGA 

40941 GAGTTCTCTTCTGCACACCAJCT 

41033 AGJW^CTGAAAACAGGGATC^^ 

41125 TTG<7IXK:CCACTCCCCGTATTCTTCACAACAa 

4 1217 CTTTCCtCTGTAOMTrTA^ 

41309 TCTGAAAATGTTATTTAATAATGCATC3W>lAAA 
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41401 CTTTTICCCTTGCAATAATTACCCJ\GSG^TATGTTC CAAGAITX^GTAAG?\AAGCGATTCTGTC C GOTTU^ATGATATTGCTAACATTTTATA 

41493 ps^w3v3J&^^ 

41585 TGCGAAGACATGAAGGC&Aft 

4 1 677 £13\CTACTAAAGTTTTTGCGTM 

4 1769 GATITCGCTCTGGTTACLACT 

4 18 61 TCTGCCCAAATTGTGTCCCCT^ 

4 1953 xcTGAGCXI^ACTCTACTCATCTTTCT^ 

42045 J^CGCCTCCTCTTTTTAAGGG^ 

4 2 137 TCCCACGAXGAATACTGATCTC^ 

42229 GTGATTGCCCTCJ^ATGCCGTAAGT^JGCTTACAETGT 

42321 ATGGTCCTG7\GTACAIACMTGCAGC^^ 

42413 TGAAACAAAAAT7VTCTC7J\7\TTTCT 

42505 AACAOuSTTAGAAGATTAAAC 

42597 AGGAAGGGAAGACCGGCTCK^G?^^ 

42689 CCTCTCTGQ3JAAITGTTGACT 

42781 ACACAGAGACAGGAGCAGTTC 

42873 TCATGITCAAGGAXTATTTTATAAAT^ 

42965 AAAGAAITCAGTTTGrrGrrAACTy^GCW 

43057 TCTAAXAAACCAGATTCAACATAAACC^ 

43149 ACTTTTAAAATXCCATGTC^TACATJ^ 

43241 cTAGAAGTAt^ATCTCTCAGCAG^ 

43333 cCAGTGAAGATCATTTAATTAAA^ 

43425 GAGTGATCTlAAAAArGGTGAXGAAGT^GCCTGr^ 

43517 TTATAITrAA?\AGTATAATTTGTAATAAA 
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hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 


SF FCESLFI I MN FANSDRPMKATAFPAEVKDLTKRI RTVLMATAQ3C4KEHEKDPEML I DLQ 
EHLRRSLRT I LAYSEEDTAMQMTPFPTQVEELLCNLNSILYDTVKMREFQEDPEMLMDLM 

EFi^-R£TJCT-I-tsTYAEE-DLE-LRE^ 

TR FCOS LS 1 1 NNCANSDRLIKHTS FSSDVKDLTKRIRTVLMATACWKEHENDPEMLVDLQ 
F H : JIR5 LKT I LTYAEEDMGLRDST FAEQVQDLMFNLHMI LTDXVKMKEHQEDPEML I DLM 
SPFQHS LAI TNK FANGDKQMKNSN FPAE VKDLTKR I RTVLMATAQJ4KEHEKDPEMLVDLQ 


Y£ I.AKS YASTFELRKTWLDSMAKIHVKNGI 

YF I AKS YQASFDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALVAEYlpMLEDH- 
YR I AKGYQTSFE - RLTWLQNMAGKHSERSh 
YS LAK5 YA5TFELRKTWLDSMARI HVKNGI 

YRIAPGYQGSFD LRLTWLQNMAGKHAELGb HAEAAQCMVHAAALVAE YIjALLE DQ- 
YSLANSYASTFELRRTWLESMAKIHARNGI 


•* * * 


transmembrane 


FS EAAMC YVHVAALV AE Fl HRKK 


HAEAAQCLVHSAALVAEYI SMLEDR — 

LS EAAMC YVHVTALVAEY1 TRKG 


LSEAAMCYIHIAALIAEY1 KRKGYWKVEKI 


1539 
1543 

lOOS 

1551 
1582 
1679 


1592 
1598 
1718 
16j04 
1637 
1739 


LFPNGCSAFKKITPNIDEEGAMKEDAGMMD 1622 

S YLPVGSVS FQNISSNVLEESWSEDTLSPDEDGV 1 633 

KYLPVGCVT FQNI SSHVLEESAVSDDWS PDEEG I 1753 

VFRQGCTAFRVITPNIDEEASMMEDVGMQD 1634 

RHLPVGCVS FQNI SSNVLEESAI SDDI LSPDEEGF 1 67 2 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 1795 


IT AM 


VHYSEEVLLELLEQCVDGLWKAERYEI ISEISKLIVPIYEKRREFEKLTQV YRT1 HG 

CAGOYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPILEAHREFRKLTLT HSKI QR 
CSGKYFTESGLVGLLEQAAASFSMAGKYEAVNEVYKVLIPIHEANRDAKKLSTIHGKIQE 

VH FNEDVLMELLEQCADGLWKAERYELIAPI YTCLI I PIYEKRR 

CSGKH FTELGLVGLLEQAAGYFTMGGLYEAVNEVYKNLI PILEAHRDYKKLAAV HGK1 QE 
TPYNENI LVEQLYMCGEFLWKSERYELIADVNKPI IAVFEKQRDFKKLSDL YYDI HR 


1679 
1693 
1813 
1677 
1732 
1852 


DOCK motif 


1 FFEDEC GKE^ I YKEE KLTPLSEISQRLLKI YS 

FT K IMH QS S GWEKV FGljY FR^GEfYG- A H FGDLI EQE I V YKEE S I TKLAEI SHRLEEI YT 
YLKYAEWNSEKRLFGFYYR^ AF YGQG FFEEEE GKETi I YKE E KLTGLS E I S QRLLK1 YA 


•L: 


ITAM DOCK motif 

iYllq LEVMHTKKRLU51|FFR^ 
P FDSljVNKDH — KRMFGTIyFRV GE FG-SKFGDLIEQEE"V YKEE AITKLPEISHRLEAi YG 
AFSKIVHQSTGWERMFGTtYFRVGEYG-lKFGDLr EQEEV YKEE AITKLAEISHRLEGI YG 


ITAM 


* * *+* *4 


1739 
1750 
1872 
1710 
1791 
1912 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 


hCIiASP4 
hCI*ASP5 

HCLASP3 
hCLASP2 
hCLASP7 
hCIASPl 


hCI>ASP4 
hCLASPS 
hCIASP3 
hCIASF2 
hCUVSP7 
hCLASPl 


hCLASP4 
hClASPS 
hCLASP3 
hCLASP2 
HC1ASP7 
hCLASPl 


hCLASP4 
hCLASPS 
hCIASP3 
hCLASP2 
hCLASP7 
hCLASPl 


IT AM r IT A M 

O^FGTENVKIIQDSDKVNAKELDPf YAHIQ\hfrvKBYFDDKELTERKTEFERNHNISRFV 


QC FGA£ FVE V I KD S T P VDKTKLDP* KAYIQI T|EVEljYFT>EYEMKDRVTYFEKNFNLRRFM 
EP FGEDWEVIKDSWPVDKCKLDP1 KAYI Ql 

DKFGSENVKMIQDSGKVNPKDLDSJYAYIQ\ TtiVI^FFDEKELQERKTEFERSHNIRRFM 
EP FGDDWE I IKDS YPVDKSKLDS< KAYIQI 


DKFGADNVKIIQDSNKVNPKDLDPJ YAYIQ\ T YVTI FFEEKEIEDRKTDFEMHHNINRFV 1972 


1799 
1810 

YFDTYEMKDRITYFDKNYNLRRFM 1932 

1770 

ItYVEflYFDTYELKDRVTYFDRNYGLRTFL 1851 


* * * • 


* » ■ * * * 


IT AM 


DOCK motif 


FEAPYTLSGKKQGCIEEQCKRRTILTTSNSFP VVKI* RI PINCEQQ^ LKPIDGATDEIKD 
YTT P FTLEGRPRGELHEQYRRNTVLTTMHAFF RISVIQKEEFV LTPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFF YIK1 RVNVTHKEE I ] LTPIEVAIEEMQK 
FEMP FTQTGKRQGGVEEQCKRRT I LTAJ HCFP YVKb RI PVMYQHHTI LNP IE VAI DEMSK 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFF RIRVCHREET\ LTPVEVAI EDMQK 
FETPFTLSGKKHGGVMQCKRRTILTTSHLFPrVKlRIQVISQSSTI LNPIEVAIDEMSR 


* •* ♦ + 


Coiled-coil 


KTAELQKLCSSTDVDMIQLQLKLQC WVSVQVNAGPIAYARAFLNDSQASKYPPKKVSELK 
KTLQLAVAINQEPPDAKMLQMVLQC SVGATVNQGPLEVAQVFLAE I PADPKLYRHHNKLR 
KTQELAFATHQDPADPKMLQMVLQC SVGTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLR 
KVAELRQLCSSAEVDMIKLQLKLQC SVSVQVNAGPLAYABAFLDDTNTKRYPDNKVKLLK 
KT RELAFAT EQDP PDAKMLQMVLQC SVGPTVNQGPLEVAQVFLAE I PEDPKLFRHHNKLR 
KVSELNQLCTMEEVDMISLQLKLQC SVSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 


1859 
1870 
1992 
1830 
1911 
2032 


1919 
1930 
2052 
1890 
1971 
2092 


Coiled-coil 


DM FRKFI QACS 1 ALELNERL I KEDQVE YHEGLKSN FRDMVKELS DI I HEQ I LQEDTMHS P 1979 
LCFKEFIMRCGSAVEKNKRLITADQREYQQELKKNYNKLKENLRP^ IERKIPELYKPIFR 1990 

LCFKDFTKRCE1>ALRKNKSLIGPVQKEYQRELGKLSSP 2090 

EVFRQFVEACGCfALAVNERLIKEDQLEYQEEMKANYREMAKELSEI 4HEQICPLEEKTS- 1949 
LCFKD FCKKCE I >ALRKNKAL I GPDQKE YHRELERN YCRLREALQPL LTQRLPQLMAPT P - 2030 
EIFRQFADACGOALDVNERLIKEDQLEYQEELRSHYKDMLSELSTV tfNEQITGRDDLSKR 2152 


PDZ^ 

WMSNTLHVFCAISGTSSDRGYGSPF 
VESQKRDSFHRSSFRKCETQIiSOGS 


liaand 

]— 2008 
- 2015 


VLPNSLHIFNAISGTPTSTMVHGMTS dSSVvl 1980 

— PGLRNS LNRAS FRKADL - — 2047 

GVDQTCTRVISKATPALPTVSIS dSAEV4 — 2180 
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